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About Paris Version 5.2 Change Pages 


Purpose of These Change Pages 

Change pages correct and update a manual. The change pages in this packet provide corrections 
to dictionary entries in the Paris Reference Manual Supplement, Version 5.1. 


What Has Changed? 

The Version 5.2 Paris Release Notes include descriptions of the documentation errors corrected 
by the change pages included in this packet. 


What to Do with These Pages 

By page number, replace the existing pages in the Paris Reference Manual Supplement, Version 
5.1. In each case, simply tear out the existing page and replace it with the new one. 


Placement of Change Pages 

Change Page 

Replace 

Sequence 

pages 

17, 18 

17-18 

43,44 

43-44 


After inserting the change pages, this explanatory page and the title page for this change pages 
packet may be discarded. 
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Contents for Supplement Change Pages 


CHANGE-FIELD-ALIAS . 17 

C-F-C IS. 18 

MAKE-FIELD-AUAS . 43 

F-MOD. 44 
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CHANGE-FIELD-ALIAS 


CHANGE-FIELD-ALIAS 


Changes the referent of the specified field alias. 


Formats CM:change-field-alias alias-id, field-id 

Operands alias-id An alias field-id. This must be an alias field-id returned by 

CM:make-field-alias. It need not be in the current VP set. 

field-id A field-id. This must be a field id returned by CM:allocate-stack- 
field or CM:allocate-heap-field; it may not be an offset into a field. 
The field need not be in the current VP set. 

Context This operation is unconditional. It does not depend on the context-flag. 


The alias field id alias-id is made to reference the field identified by field-id. This function 
allows field aliases to be recycled. 

After a call to CM:change-fieId-alias, the field length and the physical length associated with 
alias-id are exactly what they would be if CM: make-field-alias had been called with field-id . 

An error is signaled if the physical length of the aliased field is not exactly divisible by 
the VP ratio of the VP set to which field-id belongs. (For more on the physical length 
associated with an alias field see the dictionary entry for CM:make-field-alias.) 

The alias field-id can be used in all the same ways as a regular field-id can, with the following 
exceptions: 

• It cannot be passed to CM:deallocate-heap-field. 

• It cannot be passed to CM:deallocate-stack-through. 
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CIS 




C-F-CIS 

Calculates the cosine and sine for the floating-point source field and stores the result in the 
complex destination field. 


Formats CM:c-f-cis-2-lL dest , source , s, e 


Operands dest The complex destination field. 

source The floating-point source field. 

Sj e The significand and exponent lengths for the dest and source fields. 

The total length of the dest field in this format is 2(s + e + l). The 
total length of the source field in this format is s + e + 1. 

Overlap The source field must be either identical to dest, identical to (dest + s+ e + 1), 
or disjoint from dest. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1„ 


Definition For every virtual processor k in the current-vp-set do 
if context-flag{k] = 1 then 
dest[k].real cos source[k] 
dest[k].imag sin source[k] 


The result is a complex number whose real part is the cosine of the source and whose 
imaginary part is the sine of the source . The term cis signifies cos+i sin. 
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MAKE-FIELD-ALIAS 


MAKE-FIELD-ALIAS 


Creates a new field-id that points to an existing field. 


Formats result 


CM: make-field-alias field-id 


Operands field-id A field-id. This must be a field id returned by CMrallocate-stack- 

field or CM:allocate-heap-field; it may not be an offset into a field. 
The field need not be in the current VP set. 


Result A field-id, the alias field-id. This id initially resides in the current VP set. 
Context This operation is unconditional. It does not depend on the context-flag. 


The return value is a field alias. It is a new field-id that identifies the same area of memory 
as does field-id. 

The field identified by field-id can be in a VP set other than the current VP set. The 
returned alias field-id initially resides in the current VP set. The alias field-id can be used 
in all the same ways as a regular field-id can, with the following exceptions: 

• It cannot be passed to CM:deallocate-heap-field. 

• It cannot be passed to CM:deallocate-stack-through. 

Associated with a field alias is a physical length: the number of bits that the field occupies 
in each physical processor. Also associated with a field alias is a field length: the number 
of bits the field occupies in each virtual processor. The physical length is equal to the field 
length multiplied by the VP ratio of the current VP set. It is an error if the physical length 
is not exactly divisible by the VP ratio of the current VP set. 

It is possible for the field length of an alias field to be different from the field length of the 
original field. This is the case when make-field-alias is called on a field in a VP set that has 
a VP ratio different from the VP ratio of the current VP set. Suppose, for example, the 
current VP ratio is 32. If we make an alias for a 32-bit field that resides in a VP set with 
a VP ratio of 1, the resulting alias field is a 1 bit field (in a VP ratio of 32). 


43 





F-MOD 

The residue of one floating-point source value divided by another is placed in the destination 
field. Overflow is also computed. 


Formats CM:f-mod-2-lL dest/sourcel , sourceZ , s, e 

CM:f-mod-3-lL dest y sourcel , sourceZ , s, e 

CM:f-mod-constant-2-lL dest/sourcel , source2-value , s, e 
CM:f-mod-constant-3-lL desf, sourcel , source2-value , s, e 

Operands dest The floating-point destination field. This is the quotient. 

sourcel The floating-point first source field. This is the dividend. 

source2 The floating-point second source field. This is the divisor. 

source2-value A floating-point immediate operand to be used as the second 
source. 

s, e The significand and exponent lengths for the dest, source i, and 

source2 fields. The total length of an operand in this format is 
s + e + 1. 

Overlap The fields sourcel and source2 may overlap in any manner. Each of them, 
however, must be either disjoint from or identical to the dest field. Two 
floating-point fields are identical if they have the same address and the same 
format. It is permissible for all the fields to be identical. 

Flags test-flag is set if division by zero occurs; otherwise it is cleared. 

overflow-flag is set if floating-point overflow occurs; otherwise it is unaffected. 

Context This operation is conditional. The destination and flags may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ = 1 then 
if source2[k] = 0 then 
dest[k] <r- (unpredictable) 
test-flag[k] <— 1 
else 

dest[k] <— sourcel[k] — source2[k] x 
test-flag[k] 0 

if (overflow occurred in processor k) then overflow-flag[k ] 1 
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About Paris Version 5.2 Change Pages 


Purpose of These Change Pages 

Change pages correct and update a manual. The change pages in this packet provide 

■ dictionary entries for Paris instructions new with Version 5.2 

■ dictionary entries for Paris instructions changed with Version 5.2 

■ corrected dictionary entries for Version 5.0 Paris instructions 


What Has Changed? 

The Version 52 Paris Release Notes describe the new and changed features that are documented 
by these pages. The release notes also include descriptions of all the documentation errors cor¬ 
rected by change pages included in this packet. 


What to Do with These Pages 

By page number, insert the change pages into your copy of the Paris Reference Manual, 
Version 5.0. 

Additional Pages 

Any change page with a page number ending in a letter must be added to the exist¬ 
ing manual. Find the page whose number matches the number part of the change 
page number and insert the change page behind it. 

Replacement Pages 

Any change page with a normal page number replaces an existing Paris manual 
page. Tear out the existing page and replace it with the new one. 


Note that many of the replacement pages are included only to preserve the order 
of the Paris dictionary entries. 




About Paris 5.2 Change Pages 


Placement of Change Pages 


Change Page Add after Replace 

Sequence page pages 


45,46 


45-46 

51,52 


51-52 

63,64 


63-64 

83, 84, 85, 86 


83-86 

89, 90, 91, 92 

92a 

92 

89-92 

93, 94, 95, 96, 97, 98 


93-98 

106a 

106 


107,107a, 107b, 107c, 107d, 107e, 107f 

107 

108 


108 

113,113a, 113b, 113c, 114 


113-114 

117,117a, 117b, 118 


117-118 

123, 124 


123-124 

133,134, 135,136 


133-136 

142a, 142b 

143,143a, 143b, 143c, 144 

142 

143-144 

157,157a, 158,159, 160, 161, 162 


157-162 

181,182 


181-182 

185, 186, 187, 188 


185-188 











About Paris 5.2 Change Pages 


iii 


Placement of Change Pages (continued) 


Change Page Add after Replace 

Sequence page pages 


211, 212 

211-212 

271, 272, 273, 274, 275, 276, 277, 278, 

279, 280, 281, 282, 283, 284, 285, 286 

271-286 

301, 302, 303, 304 

301-304 

307, 308, 309,310, 311, 312 

307-312 

340a, 340b, 340c, 340c, 340d 340 

341, 342 

341-342 

373, 374, 375, 376, 377, 378, 379, 380 

373-380 

435, 435a 

435 

449, 449a, 449b, 449c, 450 

449-450 

455, 456 

455-456 


After inserting all change pages, these explanatory pages and the title page for this change pages 
packet may be discarded. 
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Chapter 6. The C/Paris Interface 


inclusion 

One of the values CM_exclusive or CMJnclusive, indicating the boundaries of a scan 
instruction. 

smode 

One of the values CM_none, CM-start_bit, or CM_segment_bit, indicating how a scan 
operation is to be partitioned. 

There are other symbolic values as well, but these are the most important. All names 
are formed by the standard rule: starting from a Lisp name such as :start-bit, add “CM” to 
the front and then convert colons and hyphens to underscores, yielding CM^start_bit. 

6.3 C/Paris Configuration Variables 

The configuration variables provide access to information about the configuration of the 
Connection Machine system. See section 3.6 for a list. The C/Paris interface makes these 
variables accessible through variables declared in the C/Paris header file. They are ini¬ 
tialized in an application program by a call to the subroutine CMJnit and should not be 
changed by an application program. 

Each configuration variable is a numeric value that is constant over the course of a 
session (from one cold boot operation to the next), or varies from one Connection Machine 
configuration to another. For example, CM_physical_processors_limit is a value that depends 
upon the size of the Connection Machine to which the application is attached. 

Numeric values that are constant for a given release of the CM System Software are given 
in ^define statements. 

6.4 Calling Paris from C 

This section describes how to build C programs that access the Paris instruction set using 
the C/Paris interface. Such programs must manage the dynamic allocation and deallocation 
of Connection Machine fields directly. This section describes the form of C main programs 
and subprograms that call the C/Paris interface, as well as the steps involved in compiling 
and linking such programs. 

The following code fragment illustrates the structure of a C main program that calls Paris 
instructions. 

#include <cm/paris.h> 

mainO { 

CM_init(); 


CM_paris_instruction(...); 

if ( CM_configuration_variable > limit ) ... 
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Chapter 6. The C/Paris Interface 



Note that the call to CMJnit is required prior to any other calls to Paris instructions. 
The following code fragment illustrates the structure of a C subroutine subprogram that 
calls Paris instructions. 

#include <cm/paris.h> 
float test() { 

CM_paris_instruction(...); 


if ( CM_configuration_variable > limit ) ... 


} 

It looks exactly like a main program in its use of Paris, except that a subprogram should 
not call CMJnit. 

Use the following command to compile and link these program units: 

'/, cc main.c test.c -Iparis ~lm 

Note that there should be no space between the -i option and its argument. 
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Chapter 7. The Fortran/Paris Interface 


srnode 

One of the values CM_none, CM_start_bit, or CM_segment_bit, indicating how a scan 
operation is to be partitioned. 

There are other symbolic values as well, but these are the most important. All names 
are formed by the standard rule: starting from a Lisp name such as : start-bit, add “CM” to 
the front and then convert colons and hyphens to underscores, yielding CMjstart_bit. 

7.3 Fortran/Paris Configuration Variables 

The configuration variables provide access to information about the configuration of the 
Connection Machine system. See section 3.6 for a list. The Fortran/Paris interface makes 
these variables accessible through variables declared in the common block named cmval, 
defined by the Fortran/Paris header file. They are initialized in an application program by 
a call to the subroutine CM Jnit and should not be changed by an application program. 

Each configuration variable is a numeric value that is constant over the course of a 
session (from one cold boot operation to the next), or varies from one Connection Machine 
configuration to another. For example, CM_physical_processorsJimit is a value that depends 
upon the size of the Connection Machine to which the application is attached. Most of 
these configuration variables are declared to be of Fortran type INTEGER. 

Numeric values that are constant for a given release of the CM System Software are also 
given in PARAMETER statements. 

7.4 Calling Paris from Fortran 

This section describes how to build Fortran programs that access the Paris instruction set 
using the Fortran/Paris interface. Such programs must manage the dynamic allocation 
and deallocation of Connection Machine fields directly. This section describes the form of 
Fortran main programs and subprograms that call the Fortran/Paris interface, as well as 
the steps involved in compiling and linking such programs. 

The following code fragment illustrates the structure of a Fortran main program that 
calls Paris instructions. 

PROGRAM main 

C VAX Fortran or Sun Fortran 

INCLUDE */usr/include/cm/paris-configuration-fort.h’ 

CALL CM.initO 

CALL CM_paris_instruction(...) 

IF ( CM_configuration_variable .GT. limit ) ... 

END 
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Chapter 7. The Fortran/Paris Interface 


Note that the call to CMJnit is required prior to any other calls to Paris instructions. 
The following code fragment illustrates the structure of a Fortran subroutine subprogram 
that calls Paris instructions. 

SUBROUTINE test 

C VAX Fortran or Sun Fortran 

INCLUDE ’/usr/include/cm/paris-configuration-fort„h’ 

CALL CM_paris_instruction(...) 

IF ( CH.configuration.variable .GT. limit ) ... 

END 

It looks exactly like a main program in its use of Paris, except that a subprogram should 
not call CMJnit. 

Using VAX Fortran, the following command compiles and link s these program units to 
ran on the Connection Machine Model 2: 

7. fort main.for test.for -Iparisfort -lparis 

Note that there should be no space between the -I option and its argument. 

Using Sun Fortran, the following command compiles and links these program units to 
ran on the Connection Machine Model 2: 

7, f77 main.f test.f -Iparisfort -lparis 

Note that there should be no space between the -I option and its argument. 
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ABS 


F-ABS 

Computes, in each selected processor, the absolute value of a floating-point source field and 
stores it in the destination field. 


Formats CM:f-abs-l-lL dest! source, s , e 
CM:f-abs-2-lL dest, source , s, e 


Operands dest 

source 
s, e 


The floating-point destination field. 

The floating-point source field. 

The significand and exponent lengths for the dest and source fields. 
The total length of an operand in this format is s + e + 1. 


Overlap The source field must be either disjoint from or identical to the dest field. 

Two floating-point fields are identical if they have the same address and the 
same format. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

if source[k] > 0 then dest[k] <— source[k] 
else dest[k] < - source[k ] 


The absolute value of the source operand is placed in the dest operand. 

For floating-point numbers, absolute value is calculated by changing the sign bit to 0 (pos¬ 
itive). All other bits in the number are unchanged. As a result, the absolute values of 
negative infinities, denormalized numbers, and NaN’s are their positive counterparts. 
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ABS 


S-ABS 

Computes the absolute value of a signed integer source field and stores it in the destination 
field. 


Formats CM:s-abs-l-lL dest/source, len 
CM:s-abs-2-lL dest , source , len 
CM:s-abs-2-2L dest, source, dlen, slen 

Operands dest The signed integer destination field. 

source The signed integer source field. 

len The length of the dest and source fields. This must be no smaller 

than 2 but no greater than CM: *maximum-integer-length*. 

dlen The length of the dest field. This must be no smaller than 2 but 

no greater than CM:*maximum-integer-length*. 

slen The length of the source field. This must be no smaller than 2 but 

no greater than CM:*maximum-integer-length*. 

Overlap The source field must be either disjoint from or identical to the dest field. 

Two integer fields are identical if they have the same address and the same 
length. 

Flags overflow-flag is set if the result cannot be represented in the destination field; 

otherwise it is cleared. 

Context This operation is conditional. The destination and flag may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

if source[k ] > 0 then dest[k] <— source[k] 
else dest[k] <- source[k] 

if (overflow occurred in processor k) then overflow-flag[k] *— 1 
else overflow-fla^k] *— 0 

The absolute value of the source operand is placed in the dest operand. (If the length of 
the dest field equals the length n of the source field, overflow can occur only if the source 
field contains — 2 n . If the length of the dest field is greater than the length of the source 
field, then overflow cannot occur.) 
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ALLOCATE-HEAP-FIELD-VP-SET 


ALLOCATE-HEAP-FIELD-VP-SET 

Allocates a new heap field of the specified length in the specified VP set and returns a 
unique identifier. 


Formats 

Operands 

Result 

Context 


result «— CM :allocate-heap-field-vp-set Zen, vp-set-id 

len An unsigned integer, the length in bits of the field to be allocated. 

vp-set-id A vp-set-id. 

An unsigned integer, the new field-id. 

This operation is unconditional. It does not depend on the context-flag. 


A new field of length len is allocated on the heap within the specified VP set. A field-id 
for the newly created field is returned. 
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ALLOCATE-STACK-FIELD 


ALLOCATE-STACK-FIELD 

Allocates a new stack field of specified length in the current VP set and returns a unique 
identifier. 


Formats 

Operands 

Result 

Context 


result <— CM:allocate-stack-field len 

len An unsigned integer, the length, in bits, of the field to be allocated. 

An unsigned integer, the new field-id. 

This operation is unconditional. It does not depend on the context-flag. 


A new field of length len is allocated on the stack within the current VP set. A field-id for 
the newly created field is returned. 
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ALLOCATE-STACK-FIELD-VP-SET 


ALLOCATE-STACK-FIELD-VP-SET 

Allocates a new stack field of the specified length in the specified VP set and returns a 
unique identifier. 


Formats 

Operands 

Result 

Context 


result *— CM:allocate-stack-field-vp-set len, vp-set-id 

len An unsigned integer, the length in bits of the field to be allocated. 

vp-set-id A vp-set-id. 

An unsigned integer, the new field-id. 

This operation is unconditional. It does not depend on the context-flag. 


A new field of length len is allocated on the stack within the specified VP set. A field-id 
for the newly created field is returned. 
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ALLOCATE-VP-SET 




ALLOCATE-VP-SET 


Create a new VP set, within which fields may be allocated. 


Formats 

Operands 

Result 

Context 


result <— CM:allocate-vp-set geometry-id. 
geometry-id A geometry-id. 

A vp-set-id, identifying the newly allocated VP set. 

This operation is unconditional. It does not depend on the context-flag. 


This operation returns a vp-set-id for a newly created VP set. This may be given to other 
Paris operations in order to create memory fields in which data may be stored. The size 
and shape of the VP set is determined by the geometry specified by the geometry-id. It is 
possible to alter the geometry later (by using CM: set-vp-set-geometry), but the total number 
of virtual processors in the VP set remains forever fixed. 
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AREF32 


AREF32 

Fetches array elements specified by a per-processor index and copies them to a fixed desti¬ 
nation. The array is stored in a special format that allows fast access. 

Formats CM:aref32-2L dest, array , index , dlen, index-len, index-limit 

CM:aref32-always-2L dest, array, index, dlen, index-len , index-limit 

Operands dest The destination field. 

array The source array field. This must contain data stored in a special 
format by either CM:aset32 or CM:transpose32. 

index The unsigned integer index field. This is used as the per-processor 
index into the array. 

dlen The length of the dest field. This must be non-negative and no 

greater than CM:*maximum-integer-length*. This is taken as the 
array element length and must be a multiple of 32. 

index-len The length of the index field. This must be non-negative and no 
greater than CM:*maximum-integer-length*. 

index-limit An unsigned integer immediate operand to be used as the 
exclusive upper bound for the index . This is taken as the array 
extent. 

Overlap The fields array and index may overlap in any manner. However, the array 
and index fields must not overlap the dest field. 

Context The non-always operations are conditional. The destination may be altered 
only in processors whose context-flag is 1. 

The always operations are unconditional. The destination may be altered 
regardless of the value of the context-flag. 

Definition For every virtual processor k in the current-vp-set do 
if (always or context-flag[k] = 1) then 
if index[k] < index-limit then 

let r = geometry-total-vp-ratio(geometry(current-vp-set)) 
let m = mod 32 
let i = index[k] 

for all j such that 0 < j < dlen do 

dest[k](j) array[k - m x r + (j mod 32) X r](32 X ( i + [^rj)) 

else 

(error) 
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This is a simple form of array reference for parallel arrays whose elements are stored 
across the memory of individual processors. To each processor belongs an array of extent 
index-limit with elements of length dlen . 


The array element indexed by each active processor is copied into the dest field of that 
processor. Different processors may reference different elements of their arrays. For this 
reason, this form of array referencing is known as indirect addressing . 


Each processor has an array index stored in the field index . This is used to index into an 
area of CM memory, array , whose allocated length in bits should be at least 


^ index-limit x 


~ dlen'W 
~32~ ) 


X 32 


The argument index-limit is one greater than the largest allowed value of the index. It is 
an error for any index value to equal or exceed this limit. 

A field of length d/era, and starting at address array + ix 32, where i is the the unsigned 
number stored at index , is copied to dest in all selected processors. Even this is not quite 
accurate, because the array data is not organized in the same manner as for CMraref. Instead, 
it is organized in a peculiar way for fast per-processor access. Parallel arrays stored in this 
format are termed slicewise parallel arrays . 

Slicewise parallel array data is arranged with successive bits stored in successive processors 
within groups of 32 virtual processors. Thus, slicewise array data belonging to one processor 
is spread over the memories of the 32 processors in its group and the memory of each 
processor holds data belonging to all 32 processors. 

A region of memory set aside for a slicewise array of the format required by CM: aref32 should 
be accessed only through the operations CM:aset32 and CM:aref32, related operations such 
as CM:get-aref32 and CM:send-aset32-overwrite, or operations that copy the array as a whole 
from all processors (such as I/O operations). It is also possible to operate on this memory 
in blocks of 32-bit square matrices with the CM:transpose32 instruction. 
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AREF32-SHARED 

Fetches an array element specified by a per-processor index and copies it to a fixed destina¬ 
tion. The source array is stored in a special format that allows fast access, and is accessed 
in such a way that all the virtual processors within a group of 32 physical processors share 
the same array. 

Formats CM:aref32-shared-2L dest, array , index, dlen, index-len, index-limit 

CM:aref32-shared-always-2L dest, array, index, dZen, index-len, index-limit 

Operands dest The destination field. 

array The source array field. This must be a contiguous region in CM 
memory. It need not be in the current VP set. 

index The unsigned integer index field. This is used as the per-processor 
index into array. 

dlen The length of the dest field. This must be non-negative and 

no greater than CM:♦maximum-integer-length 5 **. This is normally 
taken as the array element length and must be a multiple of 32. 
As a special case, dlen may be 8 or 16 and, if so, access into both 
the source and the destination fields is offset appropriately. 

index-len The length of the index field. This must be non-negative and no 
greater than CM:♦maximum-integer-length 5 * 8 . 

index-limit An unsigned integer immediate operand to be used as the 
exclusive upper bound for the index. This is taken as the extent 
of array. 

Overlap The fields array and index may overlap in any manner. However, the array 
and index fields must not overlap the dest field. 

Context The non-always operations are conditional. The destination may be altered 
only in processors whose context-flag is 1. 

The always operations are unconditional. The destination may be altered 
regardless of the value of the context-flag. 

Definition For every virtual processor k in the current-vp-set do 
if (always or context-flag[k] = 1) then 
if index[k] < index-limit then 

for all j such that 0 < j < dlen do 
dest[k](j) 
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array [32 + (j mod 32)J (index-limit + index[k 

else 

(error) 

where r is the VP ratio, and where j is the bit position in each field. 


This is a simple form of array reference for arrays whose elements are stored across the 
memory of individual processors and accessed in such a way that many processors appear 
to share a single array of extent index-limit with elements of length dlen . 

The shared array element (or a portion of it) indexed is copied into desi in all (selected) 
processors. Different processors may access different elements of the shared array. For this 
reason, this form of array referencing is known as indirect addressing. 

Each processor has an array index stored in the field index. This is used to index into array. 
The argument index-limit is one greater than the largest allowed value of the index. It is 
an error for any index value to equal or exceed this limit. 

The data within the source array area is not organized in the same manner as for CM:aref; 
instead, it is organized in a peculiar way for fast per-processor access. Shared arrays stored 
in this format are termed slicewise shared arrays. 

Slicewise shared array data is arranged with successive bits stored in successive processors, 
within groups of 32 physical processors. Each 32-bit word of each element is stored sepa¬ 
rately in processor memories, as follows: The low-order 32 bits of all elements are grouped 
together across processor memories in a field of length 32 X index-limit bits. Similarly, the 
next 32 bits of all elements are grouped together, and so on, up to the high-order bits of all 
array elements. This data format allows fast hardware-supported access to the individual 
elements of a shared array. 

A region of memory set aside for an array of the format required by CM:aref32-shared must 
be contiguous in memory. It must therefore be allocated all at once, at a VP ratio of 1, 
with a single call to CM:allocate-stack-field or to CM:allocate-heap-fie!d. Alternatively, from 
Lisp, the memory may be allocated within a with-stack-field form at a VP ratio of 1. 

The area of CM memory occupied by array should be allocated at a VP ratio of 1 as a 
field whose length in bits is exactly 


index-limit X dlen 


Shared array memory should be accessed only with the operations CM:aref32-shared and 
CM:aset32-shared, or with operations that copy the array as a whole from all processors 
(such as I/O operations). Data in such a region of memory may, however, be reoriented 
with the CM:tran$pose32 instruction. 
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As a special case, if the dlen argument is specified as 8 or 16, then each processor accesses 
one byte or one half-word of a 32-bit element. The index-limit argument must be specified 
as the extent of the array when considered to contain 32-bit elements. Nonetheless, valid 
index values are integers 0 through 2 or 4 times this index-limit. The index argument may 
be thought of as consisting of two fields, one that indexes a 32-bit array element and one 
that indexes an 8- or 16-bit offset into that element. To index bytes, the low 2 bits of index 
specify the offset. To index half-words, the low 1 bit of index specifies the offset. 
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ASET 

Stores into an array element specified by a per-processor index a value copied from a fixed 
source field. 


Formats 

Operands 


Overlap 


Flags 


Context 


CM:aset-2L source , array, index , s/en, index-len , index-limit , element-len 

The source field. 

The destination array field. 

The unsigned integer index into the array field. 

The length of the dest field. This must be non-negative and no 


source 

array 

index 

slen 


greater than CM:*maximum-integer-length*. 

index-len The length of the index field. This must be non-negative and no 
greater than CM:*maximum-integer-length*. 

index-limit An unsigned integer immediate operand to be used as the 
exclusive upper bound for the index . 

element-len An unsigned integer immediate operand to be used as the 
length of an array element. 

The fields source and index may overlap in any manner. However, the source 

and index fields must not overlap the array field. 

test-flag is set if the value in the index field is less than the index-limit ; 

otherwise it is cleared. 

This operation is conditional. The destination and flag may be altered only 

in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ — 1 then 

if index[k] < index-limit then 
let p = index [k] X element-len 
array [&] (p : p + slen — 1) <— source [A] 
test-flag[k] «— 1 
else 

test-flag[k] 0 


This is a simple form of array modification, for arrays stored in the memory of individual 
processors. Each processor has an array index stored in the field index . This is used to 
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index into an array, whose length in bits shotdd be index-limit x element-len. The source 
field is copied into the element indexed (or a portion of it) in all selected processors. Thus 
different processors may modify different elements of their arrays. 

More precisely, the source field is copied to a field of length slen and starting at address 
array + i x element-len, where i is the unsigned number stored at index, in all selected 
processors. 

The argument index-limit is one greater than the largest allowed value of the index. Those 
processors that have index values greater than or equal to index-limit do not alter the value 
of the destination field; they also clear test-flag. All processors in which the index field is less 
than index-limit set test-flag. The argument element-len is the length of individual elements 
of the array. Usually this will be the same as dest-length, but for certain applications it 
is worthwhile for it to differ. For example, within an array of 128-bit records one may 
store into just one 16-bit component of an indexed record by letting slen be 32, letting 
element-len be 128, and by offsetting the array address by the offset within each record of 
the 16-bit quantity to be modified. As another example, to modify a 4-character substring 
of a string of 8-bit characters, one may let slen be 32 and element-len be 8. 
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ASET32 

Copies data from a fixed source to the destination array elements specified by a per-processor 
index. The destination array is stored in a special format that allows fast access. 

Formats CM:aset32-2L source , array, index, sZen, index-len, index-limit 

Operands source The source field. 

array The destination array field. 

index The unsigned integer index field. This is used as the per-processor 
index into array. 

slen The length of the source field. This must be non-negative and no 

greater than CM:*maximum-integer-length*. This is taken as the 
array element length and must be a multiple of 32. 

index-len The length of the index field. This must be non-negative and no 
greater than CM:♦maximum-integer-length*. 

index-limit An unsigned integer immediate operand to be used as the 
exclusive upper bound for the index. This is taken as the array 
extent. 

Overlap The fields source and index may overlap in any manner. However, the source 
and index fields must not overlap the array field. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 

Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ = 1 then 

if index [fe] < index-limit then 

let r = geometry-total-vp-ratio(geometry(current-vp-set)) 
let m = |^J mod 32 
let i = index [k] 

for all j such that 0 < j < slen do 

array[k — mx r + (j mod 32) X r](32 x ( i + [^J)) source[k](j) 

else 

(error) 

This is a simple form of array modification for parallel arrays whose elements are stored 
across the memory of individual processors. To each processor belongs an array of extent 
index-limit with elements of length slen. 
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The source field value for each active processor is copied into the indexed array element 
belonging to that processor. Thus different processors may modify different elements of 
their arrays. For this reason, this form of array access is known as indirect addressing . 


Each processor has an array index stored in the field index . This is used to index into an 
area of CM memory, array , whose allocated length in bits should be at least 


(index-limit x 



X 32 


The argument index-limit is one greater than the largest allowed value of the index. It is 
an error for any index value to equal or exceed this limit. 

In all selected processors, the source field is copied to a field of length slen and starting at 
address array + i X 32, where i is the the unsigned number stored at index . Even this is not 
quite accurate, because the data within the destination array area is not organized in the 
same manner as for CM:aset. Instead, it is organized in a peculiar way for fast per-processor 
access. Parallel arrays stored in this format are termed slicewise parallel arrays . 

Slicewise parallel array data is arranged with successive bits stored in successive processors 
within groups of 32 virtual processors. Thus, slicewise array data belonging to one processor 
is spread over the memories of the 32 processors in its group and the memory of each 
processor holds data belonging to all 32 processors. 

A region of memory set aside for a slicewise array of the format required by CM: aset32 should 
be accessed only through the operations CM:aref32 and CM:aset32, related operations such 
as CM:send-aset32-overwrite and CM:get-aref32, or operations that copy the array as a whole 
from all processors (such as I/O operations). It is also possible to operate on this memory 
in blocks of 32-bit square matrices with the CM:transpose32 instruction. 
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ASET32-SHARED 

Copies data from a fixed source to the destination array elements specified by a per-processor 
index. The array is stored in a special format that allows fast access, and is accessed in 
such a way that all the virtual processors within a group of 32 physical processors share the 
same array. 

Formats CM:aset32-shared-2L source , array , index , sZera, index-len, index-limit 

Operands source The source field. 

array The destination array field. This must be contiguous region in 
CM memory. It need not be in the current VP set. 

index The unsigned integer index field. This is used as the per-processor 
index into the array. 

slen The length of the source field. This must be non-negative and 

no greater than CM:*maximum-integer-length*. This must be a 
multiple of 32 and is taken as the array element length. 

index-len The length of the index field. This must be non-negative and no 
greater than CM:*maximum-integer-Iength*. 

index-limit An unsigned integer immediate operand to be used as the 
exclusive upper bound for the index. This is taken as the extent 
of array. 

Overlap The fields source and index may overlap in any manner. However, the source 
and index fields must not overlap the array field. 

Context This operation is conditional, but whether data is copied depends only on the 
context-flag of the originating processor; the message, once transmitted to the 
receiving processor, is stored into the the field indicated by array regardless 
of the context-flag of the receiving processor. 

Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ = 1 then 

if index[k] < index-limit then 

for all j such that 0 < j < dlen do 

array 32)j (index-limit |^J + index[k 

else 

(error) 

where r is the VP ratio, and where j is the bit position in each field. 
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For any two active virtual processors, k and k\ if index[k] = index[k% then 
either source[k ] or source[k f ] is stored in dest, depending upon the implemen¬ 
tation. 


This is a simple form of array modification for arrays whose elements are stored across the 
memory of individual processors and accessed in such a way that many processors appear 
to share a single array of extent index-limit with elements of length slen. 

The source field in each selected processor is copied into the array element (or a portion of 
it) indexed. Different processors may modify different elements of the shared array. For this 
reason, this form of array referencing is known as indirect addressing . If several processors 
sharing the same array attempt to modify the same element in a single CM:aset32-shared 
operation, then one of the values is stored and the rest are discarded. 

Each processor has an array index stored in the field index . This is used to index into array . 
The argument index-limit is one greater than the largest allowed value of the index. It is 
an error for any index value to equal or exceed this limit. 

The data within the destination array area is not organized in the same manner as for 
CM:aset; instead, it is organized in a peculiar way for fast per-processor access. Shared 
arrays stored in this format are termed slicewise shared arrays . 

Slicewise shared array data is arranged with successive bits stored in successive processors, 
within groups of 32 physical processors. Each 32-bit word of each element is stored sepa¬ 
rately in processor memories, as follows: The low-order 32 bits of all elements are grouped 
together across processor memories in a field of length 32 X index-limit bits. Similarly, the 
next 32 bits of all elements axe grouped together, and so on, up to the high-order bits of all 
array elements. This data format allows fast hardware-supported access to the individual 
elements of a shared array. 

A region of memory set aside for an array of the format required by CM:aset32-shared must 
be contiguous in memory. It must therefore be allocated all at once, at a VP ratio of 1, 
with a single call to CM:aliocate-stack-field or to CM:allocate-heap-field. Alternatively, from 
Lisp, the memory may be allocated within a with-stack-field form at a VP ratio of 1. 

An area of CM memory occupied by array should be allocated at a VP ratio of 1 as a field 
whose length in bits is exactly 


index-limit x dlen 


Shared array memory should be accessed only with the operations CM:aref32-shared and 
CM:aset32-shared, or with operations that copy the array as a whole from all processors 
(such as I/O operations). Data in such a region of memory may, however, be reoriented 
with the CM:transpose32 instruction. 
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AVAILABLE-MEMORY 

Determines the number of bits of memory, per virtual processor, that remain available for 
allocation on either the heap or the stack. 

Formats result <— CM: available-memory 

Result An unsigned integer, the number of bits available. 

Context This operation is unconditional. It does not depend on the context-flag. 

The number of bits available for allocation by either CM:allocate-heap-field or CM:allocate- 
stack-field is returned to the front end as an integer. The return value represents the number 
of bits available for each virtual processor in the current VP set. 
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F-F-CEILING 

Determines the smallest integral value that is not less than the floating-point source field 
value in each selected processor and stores it in the floating-point destination field. 


Formats CM:f-f-cei!ing-l-lL dest/source, s, e 
CM:f-f-ceiIing-2-lL dest, source, s, e 


Operands dest 
source 
s, e 


The floating-point destination field. 

The floating-point source field. 

The significand and exponent lengths for the dest and source fields. 
The total length of an operand in this format is s + e + 1. 


Overlap The source field must be either disjoint from or identical to the dest field. 

Two floating-point fields are identical if they have the same address and the 
same format. 


Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current’vp~ set do 
if context’flag[k\ = 1 then 
dest[k] <— \source[k ]] 


The source field, treated as a floating-point number, is rounded to the nearest integer in 
the direction of +oo, which is stored into the dest field as a floating-point-number. 

Note that overflow cannot occur. 
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S-CEILING 

The ceiling of the quotient of two signed integer source values is placed in the destination 
field. Overflow is also computed. 


Formats CM:s-ceiling-3-3L dest, sourcel , sourceZ, dlen , slenf slen2 


Operands dest 

sourcel 

sourceZ 

dlen 

slenl 

slen2 


The signed integer quotient field. 

The signed integer dividend field. 

The signed integer divisor field. 

For CM:s-ceiling-3-3L, the length of the dest field. This must be no 
smaller than 2 but no greater than CM: *maximum-integer-length*. 

For CM:s-ceiling-3-3L, the length of the sourcel field. This must 
be no smaller than 2 but no greater than CM:♦maximum-integer- 
length*. 

For CM:s-ceiling-3-3L, the length of the source2 field. This must 
be no smaller than 2 but no greater than CM:♦maximum-integer- 
length*. 


Overlap The fields sourcel and source2 may overlap in any manner. Each of them, 
however, must be either disjoint from or identical to the dest field. Two integer 
fields are identical if they have the same address and the same length. It is 
permissible for all the fields to be identical. 


Flags overflow-flag is set if the quotient cannot be represented in the destination 

field; otherwise it is cleared. 

test-flag is set if the divisor is zero; otherwise it is cleared. 

Context This operation is conditional. The destination and flags may be altered only 
in processors whose context-flag is 1. 


Definition 


For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

dest[k] « 


sourcel 

k 

source2 

T 


if (overflow occurred in processor k) then overflow-flag[k ] 
else overflow-flacfk ] <— 0 
if source2[k ] = 0 then 
test[k\ - 1 
else test[k] <— 0 
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The signed integer sourcel operand is divided by the signed integer sourceZ operand. The 
ceiling of the mathematical quotient is stored into the signed integer memory field dest. 

The overflow-flag and test-flag may be affected by these operations. If overflow occurs, then 
the destination field will contain as many of the low-order bits of the true result as will fit. 
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S-F-CEILING 

The floating-point source field values are converted to signed integer values and stored in 
the destination field. 


Formats CM:s-f-ceiling-2-2L dest, source , d/en, s, e 


Operands dest 

The signed integer destination field. 

source 

The floating-point source field. 

len 

The length of the dest field. This must be no smaller than 2 but 
no greater than CM:*maximum-integer-length*. 

s, e 

The significand and exponent lengths for the source field. The 
total length of an operand in this format is s -f e + 1. 


Overlap The fields dest and source must not overlap in any manner. 

Flags overflow-flag is set if the result cannot be represented in the dest field; other¬ 

wise it is cleared. 


Context This operation is conditional. The destination and flag may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] — 1 then 
dest[k\ 4 — \source[k]\ 

if (overflow occurred in processor k) then overflow-flag[k ] 1 

The source field, treated as a floating-point number, is rounded to the nearest integer in 
the direction of +oo. The result is stored into the dest field as a signed integer. 
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U-CEILING 

The ceiling of the quotient of two unsigned integer source values is placed in the destination 
field* Overflow is also computed. 


Formats CM:u-ceiling-3-3L dest , sourcel , sources , d/en, slenl , slenS 

Operands dest The unsigned integer quotient field. 

sourcel The unsigned integer dividend field. 
sourceS The unsigned integer divisor field. 

dlen For CM:u-ceiling-3-3L, the length of the dest field. This must be 

non-negative and no greater than CM: *maximunvmteger-length* 0 

slenl For CM:u-ceiling-3-3L, the length of the sourcel field. This must be 
non-negative and no greater than CM:*maximum-integer-Iength*. 

slenS For CM: u-ceiling-3-3L, the length of the sourceS field. This must be 
non-negative and no greater than CM:*maximum-integer-length*. 

Overlap The fields sourcel and sourceS may overlap in any manner. Each of them, 
however, must be either disjoint from or identical to the dest field. Two integer 
fields are identical if they have the same address and the same length. It is 
permissible for all the fields to be identical. 

Flags overflow-flag is set if the quotient cannot be represented in the destination 

field; otherwise it is cleared. 

test-flag is set if the divisor is zero; otherwise it is cleared. 

Context This operation is conditional. The destination and flags may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 

if context-flag[k] ~ 1 then 

j i \ sourcel[k] 

dest\k\ 4— -~-f~4 

L J sourceS[k\ 

if (overflow occurred in processor k) then overflow-flag[k] 
else overflow-flacfk ] 4— 0 
if sourceS[k] = 0 then 
test[k] 4— 1 
else test[k] 0 
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The unsigned integer sourcel operand is divided by the unsigned integer source2 operand. 
The ceiling of the mathematical quotient is stored into the unsigned integer memory field 
dest 

The overflow-flag and test-flag may be affected by these operations. If overflow occurs, then 
the destination field will contain as many of the low-order bits of the true result as will fit. 
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U-F-CEILING 

The floating-point source field values are converted to unsigned integer values and stored 
in the destination field. 


Formats CM:u-f-ceiling-2-2L dest, source , dlen , s, e 


Operands dest 


The unsigned integer destination field. 


source The floating-point source field. 


len The length of the dest field. This must be non-negative and no 

greater than CM:*maximum-integer-iength*. 

s, e The significand and exponent lengths for the source field. The 

total length of an operand in this format is s + e + 1. 


Overlap The fields dest and source must not overlap in any manner. 

Flags overflow-flag is set if the result cannot be represented in the dest field; other¬ 

wise it is cleared. 


Context This operation is conditional. The destination and flag may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 
dest \source] 

if (overflow occurred in processor k) then overflow-flag[k] 1 


The source field, treated as a floating-point number, is rounded to the nearest integer in 
the direction of +oo, which is stored into the dest field as an unsigned integer. 
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CLEAR-ALL-FLAGS 

Clears all flags (but not the context bit). 

Formats CM: clear-all-flags 

CM:clear-all-flags-always 

Context The non-always operations are conditional. 

The always operations are unconditional. 

Definition For every virtual processor k in the current-vp-set do 
if (always or context-flag[k] = 1) then 
test-flag[k] <— 0 
overflow-fla(fk\ <— 0 

Within each processor, all flags for that processor are cleared (but not the context bit). 
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most recent such operation was CM: cold-boot, then the same virtual processor configuration 
set up then will be used this time. If the most recent such operation was CM:attach, then 
the number of virtual processors will be equal to the number of physical processors, and 
the virtual news grid will have the same shape as the physical news grid. 

Bootstrapping a Connection Machine system includes the following actions: 

• Evaluating all initialization forms stored in the variable CM:♦before-cold-boot- 
initializations*. This is done before anything else. 

• Loading microcode into the Connection Machine microcontroller and initiating mi¬ 
crocontroller execution. 

• Clearing and initializing the memory of allocated Connection Machine processors. 

• Initializing all of the global configuration variables described in section 3.6. 

• Initializing the pseudo-random number generator by effectively invoking the operation 
CM:initialize-random-number-generator with no seed. 

• Initializing the system lights-display mode by effectively invoking the operation 
CM:set-system-leds-mode with an argument of t. 

• Evaluating all initialization forms stored in the variable CM:♦after-cold-boot- 
initializations*. This is done after everything else. 

If the cold-booting operation fails, then an error is signalled. If it succeeds, then three 
values axe returned: the number of virtual processors, the number of physical processors, 
and the number of bits available for the user in each virtual processor. (These are exactly 
the values of the configuration variables CM:*user-cube-address-limit*, CM: ♦physical-cube- 
address-limit*, and CM:*user-memory-address-limit*. 

In the C/Paris and Fortran/Paris interfaces, the cold-booting operation is performed by a 
user command cmcoldboot at shell level. See the Front End Subsystems manual. 
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F-COMPARE 

Compares two floating-point source values and stores into the signed integer destination 
field the result -1, 0, or 1 depending on whether the first source value is less than, equal 
to, or greater than the second source value. 


Formats CM:f-compare-3-2L dest, sourcel, sourceZ , dlen, s, e 


Operands dest 

sourcel 

sourceZ 

dlen 

s, e 


The signed integer destination field. 

The floating-point first source field. 

The floating-point second source field. 

The length of the dest field. This must be no smaller than 2 but 
no greater than CM:*maximum-integer-length*. 

The significand and exponent lengths for the sourcel and source2 
fields. The total length of an operand in this format is s + e + 1. 


Overlap The fields dest and sourcel must not overlap in any manner. The fields dest 
and source2 must not overlap in any manner. The fields sourcel and source2 
may overlap in any manner. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-up-set do 
if context-flag[k\ = 1 then 

if sourcel[k\ < source2[k] then 
dest[k] i -1 

else if sourcel[k\ > source2[k] then 
de$t[k\ <r— 1 
else 

dest[k] 0 


Two operands are compared as floating-point numbers. The destination receives the signed 
integer value -1, 0, or 1 depending on whether the first source value is less than, equal to, 
or greater than the second source value. 
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COMPARE 


S-COMPARE 

Compares two signed integer source values and stores into the signed integer destination 
field the result -1, 0, or 1 depending on whether the first source value is less than, equal 
to, or greater than the second source value. 


Formats 


CM:s-compare-3-3L dest, sourcel , source2 , dZen, slenl , slen2 


Operands dest 

sourcel 

source2 

dlen 

slenl 

slen2 


The signed integer destination field. 

The signed integer first source field. 

The signed integer second source field. 

The length of the dest field. This must be no smaller than 2 but 
no greater than CM:*maximum-integer-length*. 

The length of the sourcel field. This must be no smaller than 2 
but no greater than CM:*maximum-integer-length*. 

The length of the source2 field. This must be no smaller than 2 
but no greater than CM:*maximum-integer-length*. 


Overlap The fields dest and sourcel must not overlap in any manner. The fields dest 
and source2 must not overlap in any manner. The fields sourcel and source2 
may overlap in any manner. 


Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ = 1 then 

if sourcel[k] < source2[k ] then 
dest[k] <-1 

else if sourcel [k] > source2[k] then 
dest[k] 1 
else 

dest[k] «- 0 


Two operands are compared as signed integers. The destination receives the value -1, 0, 
or 1 depending on whether the first source value is less than, equal to, or greater than the 
second source value. 
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U-COMPARE 

Compares two unsigned integer source values and stores into the signed integer destination 
field the result -1, 0, or 1 depending on whether the first source value is less than, equal 
to, or greater than the second source value. 


Formats CM:u-compare-3-3L dest, sourcel , source2 , dlen, slenl , slen2 


Operands dest 

sourcel 

source2 

dlen 

slenl 

slen2 


The signed integer destination field. 

The unsigned integer first source field. 

The unsigned integer second source field. 

The length of the dest field. This must be no smaller than 2 but 
no greater than CM:*maximum-integer-length*. 

The length of the sourcel field. This must be non-negative and no 
greater than CM:*maximum-integer-length*. 

The length of the source2 field. This must be non-negative and no 
greater than CM:*maximum-integer-!ength*. 


Overlap The fields dest and sourcel must not overlap in any manner. The fields dest 
and source2 must not overlap in any manner. The fields sourcel and source2 
may overlap in any manner. 


Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

if sourcel[k] < source2[k ] then 
dest[k] <— —1 

else if sourcel [k] > source2[k ] then 
dest[k] <— 1 
else 

dest[k] 0 


Two operands are compared as unsigned integers. The destination receives the signed 
integer value -1, 0, or 1 depending on whether the first source value is less than, equal to, 
or greater than the second source value. 
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F-COS 

Calculates, in each selected processor, the cosine of the floating-point source field value and 
stores it in the floating-point destination field. 

Formats CM:f-cos-l-lL dest/source, s , e 
CM:f-cos-2-lL dest, source, s, e 

Operands dest The floating-point destination field. 

source The floating-point source field. 

s, e The significand and exponent lengths for the dest and source fields. 

The total length of an operand in this format is s + e + 1. 

Overlap The source field must be either disjoint from or identical to the dest field. 

Two floating-point fields are identical if they have the same address and the 
same format. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 

Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ = 1 then 
dest[k] +— cos source[k] 

The cosine of the value of the source field is stored into the dest field. 
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CREATE-DETAILED-GEOMETRY 

typedef enum {CM.news.order, CM.send.order } CM.axis.order.t; 

typedef struct CM.axis.descriptor { 
unsigned length; 
unsigned weight; 

CM.axis.order.t ordering; 
unsigned char on.chip.bits; 
unsigned char off.chip.bits; 

} * CM.axis.descriptor.t; 


Actually, this structure has other components as well. C code should use the definition of 
CM.axis.descriptor from the cmtypes.h include file. 

The Fortran/Paris interface defines CM«axis_descriptor as an array: 


INTEGER RANK,DESCRIPTOR.ARRAY(7,RANK) 


The elements of each Fortran axis descriptor are defined such that: 

DESC RJPTORJlRRAY(l,I) is the length of axis I 
DESCRIPTOR-ARRAY (2, 1) is the weight of axis I 
DESCRIPTOR-ARRAY(3,I) is the ordering of axis I 
DESC RIPTOR-ARRAY(4,I) is the on-chip bits of axis I 
DESCRIPTOR-ARRAY(6,I) is the off-chip bits of axis I 


Thus CM:axis-descriptor-array is, in Fortran, an array of axis descriptor arrays. 

The Lisp definitions of the type of the ordering component and of the axis descriptor are 
shown below. 

(deftype cm:axis-order () 9 (member -.news-order :send-order)) 

(defstruct CM:axis-descriptor 

(length 0) (weight 0) (ordering :news-order) 

(on-chip-bits 0) (off-chip-bits 0)) 

The axis-descriptor-array operand must be created by first making one axis descriptor for 
each axis and then using these to assign values to the array elements. An example in C is 
given below. Notice that axisl and axis2 are pointers to axis descriptor structures and that 
the descriptor structures are zeroed before any values are assigned. 


CM.geometry.id.t my.geometry; 
CM.axis.descriptor.t my.geometry.axes[2]; 
CM.axis.descriptor.t axis!, axis2; 
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GREATE-DETAILED-GEOMETRY 




axisl = (cm_axis_descriptor_t)malloc(sizeof(struct CM.axis.descriptor)) ; 

axis2 = (cm_axis_descriptor_t)malloc(sizeof(struct CM.axis.descriptor)); 

bzero(axisl, sizeof(struct CM.axis.descriptor)); 

bzero(axis2, sizeof(struct CM^axis.descriptor)); 

axisi->length * 128; 

axis2->Xength = 256; 

axisl->weight = 5; 

axis2->weight = 10; 

axisl->ordering = CM.news.order; 

axis2->ordering = CM.news.order; 

my.geometry.axes[0] = axisl; 
my .geometry., axes [1] = axis2; 

my .-geometry = CM.create.detailed.geometry(my.geometry.axes, 2); 

The following example specifies the same axes, descriptor array, and geometry in Lisp. 
Notice that the constructor CM: make-axis-descriptor is used. 

(setq my-geometry-axes make-array(2)) 

(setq axisl 

(CM:make-axis-descriptor :length 128 :weight 5 
sordering :news-order)) 

(setq axis2 

(CM:make-axis-descriptor :length 256 :weight 10 
:ordering :news-order))) 

(setf (aref my-geometry-axes 0) axisl) 

(setf (aref my-geometry-axis 1) axis2) 

(setq my-geometry (CM:make-detailed-geometry my-geometry-axes 2) 


Once the geometry has been created, the user may destroy the descriptors and the array 
used to provide axis information. All necessary information is copied out of these structures 
as the geometry is created. 

The “length” component of an axis descriptor specifies the length of the axis; it must be a 
power of two. 

The “weight” component of the axis descriptors specifies the relative frequency of inter- 
processor communication along different axes. For instance, in the above example it is 
assumed that communication occurs about half as often along axisl , which is given a weight 
of 5, as along axis2 , which is given a weight of 10. Only the relative values of the weight 
components matter. The same communication traffic could be specified with weights of 
1 and 2, or of 3 and 6. If all weights are 1, it is assumed that all axes are used equally 
frequently. 
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Given a set of weight components, Paris lays out the hypercube grid for optimal per¬ 
formance. Virtual processors are mapped onto the physical hypercube in a pattern that 
exploits the fact that communication is especially rapid among virtual processors within 
the same physical processor and among virtual processors within the same physical chip. 

The “ordering” component of an axis descriptor specifies how news coordinates are mapped 
onto physical processors for that axis. The value : news-order specifies the usual embedding 
of the grid into the hypercube such that processors with adjacent NEWS coordinates are in 
fact neighbors within the hypercube. The value :send-order specifies that, if processor A has 
a smaller NEWS coordinate than processor B, then A also has a smaller send-address than B. 
This ordering is rarely used. However, :send-order ordering is useful for specific applications 
such as FFT. The value : framebuffer-order is provided solely for creating VP sets that are 
used as image buffers (for details, see chapter 1 of the Generic Display Interface Reference 
Manual ). 

If the “weight” components are all 1, then the mapping of virtual to physical processors 
can be specified with the “on-chip-bits” and “off-chip-bits” components of the axis descrip¬ 
tors. This is not recommended. To tune performance for communication, use the weight 
component. 
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CREATE-GEOMETRY 


CREATE-GEOMETRY 


Creates a new geometry given the grid axis lengths. See also CM:intern-geometry. 


Formats result <— CM: create-geometry dimension-array, [rank] 

Operands dimension-array A front-end vector of unsigned integer lengths of the 

grid axes. In the Lisp interface, this may be a list of dimension 
lengths instead of an array of dimension lengths, at the user’s 
option. 

rank An unsigned integer, the rank (number of dimensions) of the 

dimension-array . This must be inbetween 1 and CM:♦max- 

geometry-rank*, inclusive. This argument is not provided when 
calling Paris from Lisp. 

Result A geometry-id, identifying the newly created geometry. 


Context This operation is unconditional. It does not depend on the context-flag . 


The dimension-array must be a one-dimensional array of nonnegative integers; each must 
be a power of two. The product of all these integers must be a multiple of the number of 
physical processors attached for use by this process. 

This operation returns a geometry-id for a newly created geometry whose dimensions are 
specified by the dimension-array. The length of axis j of the resulting geometry will be 
equal to dimension-array[j]. Such a geometry-id may then be used to create a VP set, or 
to respecify the geometry of an existing VP set. 

The geometry will be laid out so as to optimize performance under the assumption that 
the axes are used equally frequently for NEWS communication. The operation CM:create- 
detailed-geometry may be used instead to get more precise control over layout for perfor¬ 
mance tuning. 

Once the geometry has been created, the user may destroy the array used to provide the 
dimension information. All necessary information is copied out of this array as the geometry 
is created. 
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DEPOSIT-NEWS-COORDINATE 


DEPOSIT-NEWS-COORDINATE 


Modifies a send address to reflect a specific NEWS coordinate. 


Formats CM:deposit-news-coordinate-lL geometry , dest/send-address, 

axis, coordinate, slen 

C M: d ep osi t- n e ws- constant-lL geometry, dest / send- address, 

axis, coordinate-value , sZen 

Operands geometry A geometry-id. This geometry determines the news dimensions 

to be used. 

dest The unsigned integer destination field. (In the instruction for¬ 

mats currently provided, the dest field is always the same as the 
send-address source field. The length of this field is implicitly the 
same as geometry-send-address-length(geometry).) 

send-address The unsigned integer send-address field. 

axis An unsigned integer immediate operand to be used as the number 

of a news axis. 

coordinate The unsigned integer NEWS coordinate field, field. This 

specifies the position along the corrsponding axis of the proces¬ 
sor whose send address is to be calculated. 

coordinate-value An unsigned integer immediate operand to be used as 
the NEWS coordinate along the specified axis. 

slen The length of the coordinate field. This must be non-negative and 

no greater than CM:*maximum-integer-length*. 

Overlap For CM:deposit-news-coordinate-lL, the coordinate field must not overlap the 

dest field. 


Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

dest[k ] <— deposit-news-coordinate(geometry, send-address, axis, coordinate) 
where deposit-news-coordinate is as defined on page 33. 

This function calculates, within each selected processor, the send-address of a processor 
that has a specified coordinate along a specified NEWS axis, with all other coordinates equal 
to those for the processor identified by send-address. 
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DEPOSIT-NEWS-COORDINATE 


FE-DEPOS IT-NEWS-COORDINATE 

Calculates on the front end the modification of a send address to reflect a specific NEWS 
coordinate. 

Formats result <— CM:fe-deposit-news-coordinate geometry , send-address, 

axis , coordinate 

Operands geometry A geometry-id. This geometry determines the NEWS dimensions 

to be used. 

send-address An unsigned integer immediate operand to be used as the 

send address of some processor. 

axis An unsigned integer immediate operand to be used as the number 

of a news axis. 

coordinate An unsigned integer immediate operand to be used as the 

NEWS coordinate along the specified axis. 

Result An unsigned integer, the send address of the processor whose coordinate along 
the specified axis is coordinate and whose coordinate along all other axes 
equals those of send-address. 

Context This operation is unconditional. It does not depend on the context-flag . 


Definition Return deposit-news-coordinate(geometry , send-address , axis , coordinate) 
where deposit-news-coordinate is as defined on page 33. 

This function calculates, entirely on the front end, the send-address of a processor that has 
a specified coordinate along a specified NEWS axis, with all other coordinates equal to those 
for the processor identified by send-address . 
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EXP 


F-EXP 

Calculates, in each selected processor, the exponential function e x of the floating-point 
source field and stores it in the floating-point destination field. 


Formats CM:f-exp-l-lL dest/source , s, e 
CM:f-exp-2-lL dest , source , s, e 


Operands dest 

source 
s, e 


The floating-point destination field. 

The floating-point source field. 

The significand and exponent lengths for the dest and source fields. 
The total length of an operand in this format is s + e + 1. 


Overlap The source field must be either disjoint from or identical to the dest field. 

Two floating-point fields are identical if they have the same address and the 
same format. 


Flags overflow-flag is set if floating-point overflow occurs; otherwise it is unaffected. 

Context This operation is conditional. The destination and flag may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 
if source[k\ = +oo then 
dest\k\ <— -f-oo 
else if source[k] = — oo then 
dest[k] 4—1-0 
else 

dest[k] 4— exp source[k} 

if (overflow occurred in processor k) then overflow-flag[k ] 4— 1 


Call the value of the source field s; the value e s is stored into the dest field, where e « 
2.718281828 ... is the base of the natural logarithms. 
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EXTRACT-MULTI-COORDINATE 


Determines the news multi-coordinate of a processor specified by send-address. 


Formats CM:extract-multi-coordinate-lL geometry , dest , axis-mask , send-address , d/en 

Operands geometry A geometry-id. This geometry determines the NEWS dimensions 

to be used. 

dest The unsigned integer destination field. 

axis-mask An unsigned integer, the mask indicating a set of NEWS axes. 

send-address The send-address field. For each processor, this identifies the 
send-address of some other processor. 

dlen The length of the dest field. This must be non-negative and no 

greater than CM: *maximum-integer-Iength*. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let axis-set = {m|0<m<rA (axis-mask (m) = 1) } 

dest[k] extract-multi-coordinate(geometry, axis-set, send-address) 

where extract-multi-coordinate is as defined on page 34. 


This function calculates, within each selected processor, the news multi-coordinate of a 
processor along specified NEWS axes. The axes are indicated by the axis-mask argument; 
the processor is identified by its send-address. 
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FE-EXTRACT-MULTI-COORDINATE 

Calculates, on the front end, the NEWS multi- coordinate of a processor specified by send- 
address. 


Formats result CM:fe-extract-multbcoordinate geometry , axis-mask, send-address 

Operands geometry A geometry-id. This geometry determines the news dimensions 

to be used. 

axis-mask An unsigned integer, the mask indicating a set of news axes. 

send-address An unsigned integer immediate operand to be used as the 
send address of some processor. 


Result An unsigned integer, the NEWS multi-coordinate of the specified processor 
along the specified axes. 

Context This operation is unconditional. It does not depend on the context-flag. 


Definition Let axis-set = {m \ 0 < m < r A (axis-mask(m) = 1) } 

Return extract-multi-coordinate(geometry, axis-set , send-address) 

where extract-multi-coordinate is as defined on page 34. 


This function calculates, entirely on the front end, the NEWS multi-coordinate of a processor 
along specified NEWS axes. The axes are indicated by the axis-mask argument; the processor 
is identified by its send-address. 
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EXTRACT-NEWS-COORDINATE 


Determines the news coordinate of a processor specified by send-address. 


Formats CM:extract-news-coordinate-lL geometry , dest , axis , send-address, dlen 


Operands geometry A geometry-id. This geometry determines the news dimensions 

to be used. 

dest The unsigned integer destination field. 

axis An unsigned integer immediate operand to be used as the number 

of a news axis. 

send-address The send-address field. For each processor, this identifies the 
send-address of some other processor. 

dlen The length of the dest field. This must be non-negative and no 

greater than CM:*maximum-integer-length*. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ = 1 then 

dest[k] 4 — extract-news-coordinate(geometry, axis, send-address) 
where extract-news-coordinate is as defined on page 33. 


This function calculates, within each selected processor, the NEWS coordinate of a processor 
along a specified news axis. The axis is indicated by the axis argument; the processor is 
identified by its send-address. 
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FLOOR 


S-FLOOR 

The floor of the quotient of two signed integer source values is placed in the destination 
field. Overflow is also computed. 


Formats CM:s-floor-3-3L dest , sourcel , sourceZ , dZen, slenl , slen2 


Operands dest 

sourcel 

sourceZ 

dlen 

slenl 

slen2 


The signed integer quotient field. 

The signed integer dividend field. 

The signed integer divisor field. 

For CM:s-floor-3-3L, the length of the dest field. This must be no 
smaller than 2 but no greater than CM: *maximum-integer-length*. 

For CM:s-floor-3-3L, the length of the sourcel field. This must 
be no smaller than 2 but no greater than CM: *maximum-integer- 
length*. 

For CM:s-floor-3-3L, the length of the source2 field. This must 
be no smaller than 2 but no greater than CM:♦maximum-integer- 
length*. 


Overlap The fields sourcel and source2 may overlap in any manner. Each of them, 
however, must be either disjoint from or identical to the dest field. Two integer 
fields are identical if they have the same address and the same length. It is 
permissible for all the fields to be identical. 


Flags overflow-flag is set if the quotient cannot be represented in the destination 

field; otherwise it is cleared. 

test-flag is set if the divisor is zero; otherwise it is cleared. 

Context This operation is conditional. The destination and flags may be altered only 
in processors whose context-flag is 1. 


Definition 


For every virtual processor k in the current-vp-set do 
if context-flag[k ] = 1 then 

dest[k] « 


sourcel 

k 

source2 

T 


if (overflow occurred in processor k) then overflow-flag[k] 
else overflow-flacfk ] <— 0 
if source2 [&] = 0 then 
test[k\ <— 1 
else test[k] <— 0 
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FLOOR 


The signed integer sourcel operand is divided by the signed integer sourceB operand. The 
floor of the mathematical quotient is stored into the signed integer memory field dest. 

The overflow-flag and test-flag may be affected by these operations. If overflow occurs, then 
the destination field will contain as many of the low-order bits of the true result as will fit. 
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S-F-FLOOR 

Calculates, in each selected processsor, the largest integer that is not greater than a specified 
floating-point value and stores the result as a signed integer field. 


Formats CM:s-f-floor-2-2L dest, source , dlen, s, e 


Operands dest 

source 

len 

s, e 


The signed integer destination field. 

The floating-point source field. 

The length of the dest field. This must be no smaller than 2 but 
no greater than CM:*maximum-integer-length*. 

The significand and exponent lengths for the source field. The 
total length of an operand in this format is s + e + 1. 


Overlap The fields dest and source must not overlap in any manner. 

Flags overflow-flag is set if the result cannot be represented in the dest field; other¬ 

wise it is cleared. 


Context This operation is conditional. The destination and flag may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ = 1 then 
dest[k] <— [source[k)\ 

if (overflow occurred in processor k) then overflow-flag[k] 1 
else overflow-fla^k ] 0 


The source field, treated as a floating-point number, is rounded to the nearest integer in 
the direction of — 00 , which is stored into the dest field as a signed integer. 
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U-FLOOR 

The floor of the quotient of two unsigned integer source values is placed in the destination 
field. Overflow is also computed. 


Formats CM:u-floor-3-3L dest , sourcel , source 2, dlen , slenl , s/en2 

Operands dest The unsigned integer quotient field. 

sourcel The unsigned integer dividend field. 
source2 The unsigned integer divisor field. 

dlen For CM:s-floor-3-3L, the length of the dest field. This must be 

non-negative and no greater than CM:*maximum-integer-length*. 

slenl For CM:s-floor-3-3L, the length of the sourcel field. This must be 
non-negative and no greater than CM:*maximum-intGger-length*. 

slenS For CM:s-fIoor-3-3L, the length of the sourceS field. This must be 
non-negative and no greater than CM:*maximum-integer-length*. 

Overlap The fields sourcel and sourceS may overlap in any manner. Each of them, 
however, must be either disjoint from or identical to the dest field. Two integer 
fields are identical if they have the same address and the same length. It is 
permissible for all the fields to be identical. 

Flags overflow-flag is set if the quotient cannot be represented in the destination 

field; otherwise it is cleared. 

test-flag is set if the divisor is zero; otherwise it is cleared. 

Context This operation is conditional. The destination and flags may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 

if context-flag[k] = 1 then 

j i sourcel \k] 

dest\k\ —-— trfyH- 

L J isource2[k\_ 

if (overflow occurred in processor k) then overflow-flag[k\ 
else overflow-fla^k] 0 

if sourceZ [k] = 0 then 
test[k] <r- 1 
else test[k] 0 
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FLOOR 


The unsigned integer sourcel operand is divided by the unsigned integer source2 operand. 
The floor of the mathematical quotient is stored into the unsigned integer memory field 
dest 

The overflow-flag and test-flag may be affected by these operations. If overflow occurs, then 
the destination field will contain as many of the low-order bits of the true result as will fit. 
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U-F-FLOOR 

Converts floating-point source field values into unsigned integers by rounding towards —oo. 


Formats CM:u-f-floor-2-2L dest, source , dlen , s, e 


Operands dest 

The unsigned integer destination field. 

source 

The floating-point source field. 

len 

The length of the dest field. This must be non-negative and no 
greater than CM:*maximum-integer-length*. 

s, e 

The significand and exponent lengths for the source field. The 
total length of an operand in this format is s + e + 1. 


Overlap The fields dest and source must not overlap in any manner. 


Flags overflow-flag is set if the result cannot be represented in the dest field; other¬ 

wise it is cleared. 

Context This operation is conditional. The destination and flag may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 
dest <— [ source J 

if (overflow occurred in processor k) then overflow-flag[k ] 1 

The source field, treated as a floating-point number, is rounded to the nearest integer in 
the direction of —oo. The result is stored into the dest field as an unsigned integer. 
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FE-FROM-GRAY-CODE 


Calculates, on the front end, the Gray code representation of a specified integer. 


Formats 

Operands 

Result 

Context 


result <— CM:fe-from-gray-code code 

code An unsigned integer immediate operand to be used as the Gray 

encoding, represented as a nonnegative integer. 

An unsigned integer, the nonnegative integer represented by code. 

This operation is unconditioned. It does not depend on the context-flag. 


Definition Let n = integer-length(code) 

Retum [sa&J 

This function calculates, entirely on the front end, the integer represented by a bit-string 
encoding code in a particular reflected binary Gray code. 

Note that the binary value 0 is always equivalent to a Gray code string that is all 0-bits. 
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GEOMETRY-SEN D-ADDRESS-LENGTH 


Returns the number of bits needed to represent a send-address. 


Formats 

Operands 

Result 

Context 


result <— CM:geometry-send-address-length geometry-id 
geometry-id A geometry-id. 

An unsigned integer, the number of bits required to represent a send-address 
for a processor in the specified geometry. 

This operation is unconditional. It does not depend on the context-flag . 


Definition Let n = rank(geometry-id) 

n—1 

Return X) integer-length(axis-descriptors(geometry-id)[j].length — 1) 
j=o 


This operation returns the number of bits required to represent a send-address for a virtual 
processor in any VP set whose geometry is the one specified by the geometry-id. This will 
be equal to the sum of the numbers of bits needed to represent NEWS coordinates for all 
the axes. 
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GEOMETRY-SERIAL-NUMBER 


Assigns a unique number to the specified geometry. 


Formats 

Operands 

Result 

Context 


result <— CM:geometry-serial-number geometry-id 

geometry-id A geometry-id. This geometry-id must be obtained by calling 
CM:create-geometry or CM: create-detailed-geometry. 

The serial number that uniquely identifies the geometry. 

This operation is unconditional. It does not depend on the context-flag. 


A unique number, the serial number, is assigned to the specified geometry. This facilitates 
geometry-based caching; geometry serial numbers are useful as hash table keys. 

Note that geometry-id’s are not unique identifiers. After a geometry is deallocated, its id 
may be reused for another geometry. In contrast, geometry serial numbers are guaranteed 
to be unique. 


157a 











GEOMETRY-TOTAL-PROCESSORS 


GEOMETRY-TOTAL-PROCESSORS 


Returns the number of virtual processors for a geometry. 


Formats 

Operands 

Result 

Context 


result «— CM: geometry-total-processors geometry-id 
geometry-id A geometry-id. 

An unsigned integer, the total number of processors in the specified geometry. 
This operation is unconditional. It does not depend on the context-flag. 


Definition Let n = rank(geometry-id) 

n—1 

Return n axis-descriptors(geometry-id )[j ].length 
j =o 

This operation returns the total number of virtual processors in any VP set whose geometry 
is the one specified by the geometry-id. This will be equal to the product of the lengths of 
all the axes. 
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GEOMETRY-TOTAL-VP-RATIO 




GEOMETRY-TOTAL-VP-RATIO 


Returns the total YP ratio for a specified geometry. 


Formats 

Operands 

Result 

Context 


result «— CM:geometry-total-vp-ratio geometry-id. 
geometry-id A geometry-id. 

An unsigned integer, the number of virtual processors represented within each 
physical processor for the specified geometry. 

This operation is unconditional. It does not depend on the context-flag. 


Definition Let n = rank(geometry-id) 

n -1 

Return n axis-descriptor ( geometry-id) [j]. vp-ratio 
j=o 

This operation returns the total VP ratio for a specified geometry. This is equal to the 
total number of virtual processors for the geometry, divided by the total number of physical 
processors. 
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GET 


GET 

Each selected processor gets a message from a specified source processor, possibly itself. A 
source processor may supply messages even if it is not selected. Messages are all retrieved 
from the same memory address within each source processor, and all the source processors 
may be in a VP set different from the VP set of the destination processors. 


Formats CM:get-lL dest> send-address , source , len 

Operands dest The destination field. 

send-address The send-address field. For each processor, this indicates 
from which processor a message is retrieved. 

source The source field. 

len The length of the dest and source fields. 


Overlap The send-address and source may overlap in any manner. The dest field may 
overlap with the send-address or source but, if it does, then it is forbidden to 
send a message to a selected processor. In other words, the dest may overlap 
with the send-address or source only if at most one of them will be used within 
each processor. 


Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k ] = 1 then 

dest[k] source[send-address[k ]] 


For every selected processor p^, a message length bits long is sent to pd from the processor p s 
whose send-address is in the field send-address in the memory of processor pd . The message 
is taken from the source field within processor p 8 and is stored into the field at location 
dest within processor pd• Although the send-address operand is a field in the VP set of the 
destination processors, its value must specify a valid send address for source , which may 
belong to a different VP set. 

Note that more than one selected processor may request data from the same source processor 
p 5 , in which case the same data is sent to each of the requesting processors. 
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GET-AREF32 


GET-AREF32 

Each selected processor gets a message from a specified array field within any specified 
source processor (possibly itself). A source processor may supply messages even if it is 
not selected. Messages are all retrieved from the same memory address within each source 
processor. 

Formats CM:get-aref32-2L dest, send-address, array , index , dtera, index-len, index-limit 

Operands dest The destination field. 

send-address The send-address field. For each processor, this indicates 
from which processor a message is retrieved. 

array The source array field. This must be stored in the special format 
required by CM:aref32. 

index The unsigned integer index into the array field. This is used as a 
per-processor index into array . It specifies portions of the array 
memory area in increments of dlen . 

dlen The length of the dest field. 

index-len The length of the index field. This must be non-negative and no 
greater than CM:♦maximum-integer-length 5 * 6 . 

index-limit An unsigned integer immediate operand to be used as the 
exclusive upper bound for the index . This is taken as the extent 
of array. 

Overlap The send-address and array may overlap in any manner. The dest field may 
overlap with the send-address or array but, if it does, then it is forbidden to 
send a message to a selected processor. In other words, the dest may overlap 
with the send-address or array only if at most one of them will be used within 
each processor. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag \$ 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

if index[k] < index-limit then 

let r = geometry-total-vp-ratio(geometry(current-vp-set)) 
let m = |^J mod 32 
let i = index[k] 
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GET-AREF32 


for all j such that 0 < j < dlen do 

let q = send-address[k ] — m x r + (j mod 32) x r 
let b = i+ [^J 
dest[k](j) <— array[q](b) 


else 

(error) 


For every selected processor pd, a message length bits long is sent to pd from the processor p 8 
whose send-address is in the field send-address in the memory of processor pd . The message 
is taken from the array field within processor p 8 as if by the operation aref32 and is stored 
into the field at location dest within processor pd» 

Note that more than one selected processor may request data from the same source processor 
p,, possibly from different locations within the array. Note also that in each case the array 
element to be sent from processor p 8 to processor pd is determined by the value of index 
within pd, not the value within p 8 . 
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GLOBAL-MAX 


GLOBAL-U-MAX 

One unsigned integer is examined in every selected processor, and the largest of all these 
integers is returned to the front end as an unsigned integer. 


Formats result 

Operands source 
len 


CM:global-u-max-lL source , len 
The unsigned integer source field. 

The length of the source field. This must be non-negative and no 
greater than CM:*maximum-integer-Iength*. 


Result An unsigned integer, the largest of the source fields. 


Overlap There are no constraints, because overlap is not possible. 


Flags test-flag is set if the value in a particular processor equals the maximum; 

otherwise it is cleared. 


Context This operation is conditional. The result returned depends only upon proces 
sors whose context-flag is 1. 


Definition Let S = {m \ m E current-vp-set A context-flag[m] = 1} 
If |5| = 0 then 

return 2 len — 1 to front end 
else 

let R = ( ma xsource[m] 

\m€S 

For every virtual processor k in the current-vp-set do 
if context- flag[k\ = 1 then 
if source[k] = R then 
test-flag[k\ 1 
else 

test-flag[k] *— 0 
return R to front end 


The CM:global-u-max operation returns the largest of the unsigned-integer source fields of 
all selected processors. This largest value is sent to the front-end computer as an unsigned 
integer and returned as the result of the operation. In addition, the test-flag is set in every 
selected processor whose field is equal to the finally computed value, and is cleared in all 
other selected processors. If there are no selected processors, then the value 0 is returned. 
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GLOBAL-MAX-INTLEN 




GLOBAL-U-MAX-S-INTLEN 

One signed integer is examined in every selected processor, and the largest length of all 
these integers is returned to the front end as an unsigned integer. 


Formats result 

Operands source 
len 


CM:global-u-max-s-intlen-lL source , len 
The signed integer source field. 

The length of the source field. This must be no smaller than 2 but 
no greater than CM:*maximum-integer-length*. 


Result 

Overlap 

Flags 

Context 


An unsigned integer, the length of the source field value of greatest length. 

There are no constraints, because overlap is not possible. 

test-flag is set if the value in a particular processor has a length equal to the 
maximum; otherwise it is cleared. 

This operation is conditional. The result returned depends only upon proces¬ 
sors whose context-flag is 1. 


Definition 


Let S={m|mG current-vp-set A context-flag[m] = 1 } 
If |S| = 0 then 

return — 2 ien ~ 1 to front end 
else 


let R 


= t 


max 1 

m£S 


[log 2 (l + 11 + source[m]|)]) 

For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 
if source[k ] = R then 
test-flag[k\ 1 
else 

test-flag[k] <— 0 
return R to front end 


The CM:global-u-max-s-intlen operation computes the integer-length of each signed integer 
source value. The largest length is sent to the front-end computer as an unsigned integer 
and returned as the result of the operation. In addition, the test-flag is set in every selected 
processor whose field is equal to the finally computed value, and is cleared in all other 
selected processors. If there are no selected processors, then the value 0 is returned. 

A call to CM:global-u-max-s-intIen-lL is equivalent to the sequence 
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GLOBAL-MAX-INTLEN 


CM:u-integer-length-lL temp, source, len, len 
CM:global-u-max-lL temp, len 


but may be faster. 



GLOBAL-MIN 


GLOBAL-F-MIN 

One floating-point number is examined in every selected processor, and the smallest of all 
these integers (that is, the one closest to — oo) is returned to the front end as a floating-point 
number. 


Formats 

Operands 


Result 

Overlap 

Flags 


result CM:global-f-min-lL source , s, e 

source The floating-point source field. 

s, e The significand and exponent lengths for the source field. The 

total length of an operand in this format is s + e + 1. 

A floating-point number, the smallest of the source fields. 

There are no constraints, because overlap is not possible. 

test-flag is set if the value in a particular processor equals the minimum; 
otherwise it is cleared. 


Context This operation is conditional. The result returned depends only upon proces¬ 
sors whose context-flag is 1. 


Definition Let S = {m|mE current-vp-set A context-flag[m] = 1 } 
If \S\ = 0 then 

return +oo to front end 
else 

let R = ( min source[m ] 

\mes 

For every virtual processor k in the current-vp-set do 
if context-flag[k\ = 1 then 
if source[k] = R then 
test-flag[k] <— 1 
else 

test-flag[k] 0 
return R to front end 


The CM: global-f-min operation returns the smallest (that is, closest to —oo) of the floating¬ 
point source fields of all selected processors. This smallest value is sent to the front-end 
computer as a floating-point number and returned as the result of the operation. In addition, 
the test-flag is set in every selected processor whose field is equal to the finally computed 
value, and is cleared in all other selected processors. If there are no selected processors, 
then the value +oo is returned. 
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GLOBAL-MIN 


GLOBAL-S-MIN 

One signed integer is examined in every selected processor, and the smallest of all these 
integers (that is, the one closest to — oo) is returned to the front end as a signed integer. 


Formats result 

Operands source 
len 


CM:global-s-min-lL source , len 
The signed integer source field. 

The length of the source field. This must be no smaller than 2 but 
no greater than CM:*maximum-integer-length*. 


Result A signed integer, the smallest of the source fields. 


Overlap There are no constraints, because overlap is not possible. 


Flags 


test-flag is set if the value in a particular processor equals the minimum; 
otherwise it is cleared. 


Context This operation is conditional. The result returned depends only upon proces¬ 
sors whose context-flag is 1. 


Definition Let S = {m | m E current-vp-set A context-flag[m\ = 1} 
If |5| = 0 then 

return —2 /cn ” 1 to front end 
else 

let R = ( min source[m] ) to front end 

For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 
if source[k] = R then 
test-flag[k\ <— 1 
else 

test-flag[k] <— 0 
return R to front end 


The CM:global-s-min operation returns the smallest (that is, closest to —oo) of the signed- 
integer source fields of all selected processors. This smallest value is sent to the front-end 
computer as a signed integer and returned as the result of the operation. In addition, the 
test-flag is set in every selected processor whose field is equal to the finally computed value, 
and is cleared in all other selected processors. If there are no selected processors, then the 
value 2 len ~ 1 — 1 is returned. 


187 






GLOBAL-MIN 


GLOBAL-U-MIN 

One unsigned integer is examined in every selected processor, and the smallest of all these 
integers is returned to the front end as an unsigned integer. 

Formats result «— CM:global-u-min-lL source , len 

Operands source The unsigned integer source field. 

len The length of the source field. This must be non-negative and no 

greater than CM:*maximum-integer-length*. 

Result An unsigned integer, the smallest of the source fields. 

Overlap There are no constraints, because overlap is not possible. 

Flags test-flag is set if the value in a particular processor equals the minimum; 

otherwise it is cleared. 

Context This operation is conditional. The result returned depends only upon proces¬ 
sors whose context-flag is 1. 


Definition Let S = { m | m 6 current-vp-set A context-flag[m] = 1 } 

If |5| = 0 then 

return 0 to front end 
else 

let R = ( min source [ml 
\mes 1 J 

For every virtual processor k in the current-vp-set do 

if context-flag[k] = 1 then 

if source[k] — R then 

test-flag[k] 1 

else 

test-flag[k] 0 
return R to front end 

The CM: global-u-min operation returns the smallest (that is, closest to — oo) of the unsigned- 
integer source fields of all selected processors. This smallest value is sent to the front-end 
computer as an unsigned integer and returned as the result of the operation. In addition, 
the test-flag is set in every selected processor whose field is equal to the finally computed 
value, and is cleared in all other selected processors. If there are no selected processors, 
then the value 2 len — 1 is returned. 
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LOAD-CONTEXT 


LOAD-CONTEXT 

Unconditionally reads a bit from memory and loads it into the context bit. 

Formats CM: load-context source 

Operands source The source bit (a one-bit field). 

Context This operation is unconditional. 


Definition For every virtual processor k in the current-vp-set do 
context-flag[k\ source[k} 


Within each processor, a bit is read from memory and unconditionally loaded into the 
context bit for that processor. 
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LOAD-FLAG 


LOAD-flag 

Reads a bit from memory and loads it into a flag. 


Formats CM: load-test source 

CM:load-test-always source 
CM: load-overflow source 

CM: load-overflow-always source 


Operands source The source bit (a one-bit field). 


Context The non-always operations are conditional. 

The always operations are unconditional. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ = 1 then 
flag[k ] <— source[k\ 

where flag is test-flag or overflow-flag , as appropriate. 


Within each processor, a bit is read from memory and loaded into the indicated flag for 
that processor. 
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MULTISPREAD-ADD 


MULTISPREAD-F-ADD 

The destination field in every selected processor receives the sum of the floating-point source 
fields from all processors in the same hyperplane through the news grid. 


Formats CM:multispread-f-add-lL dest, source , axis-mask , s, e 


Operands 


dest 
source 
axis-mask 
s, e 


The floating-point destination field. 

The floating-point source field. 

An unsigned integer, the mask indicating a set of NEWS axes. 

The significand and exponent lengths for the dest and source fields. 
The total length of an operand in this format is s + e + 1. 


Overlap The source field must be either disjoint from or identical to the dest field. 

Two floating-point fields are identical if they have the same address and the 
same format. 


Context This operation is conditional. The destination may be altered only in proces 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g= geometry (current-vp-set) 
let r = rank(g) 

let axis-set = {m \ 0 < m < r A (axis-mask(m) = 1)} 
let Ck = { tn | m 6 hyperplane(g , k, axis-set) A context-flag[m ] = 1 } 

dest[k] I source[m\ 

\m£C k 

where hyperplane is as defined on page 36. 

See section 5.16 on page 34 for a general description of multispread operations. The 
CM:multispread-f-add operation combines source fields by performing floating-point addi¬ 
tion. 

A call to CM:multispread-f-add-lL is equivalent to the sequence 

CM:f-move-zero-always-lL temp , s, e 
CM:f-move-lL temp , source , s, e 
CM: store-context ctemp 
CM: set-context 
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MULTISPREAD-ADD 




for all integers j, 0 < j < rank(geometry(current-vp-set )), in any sequential order, do 
if axi$-mask{j) = 1 then 

CM; spread-with-f-add-lL temp , temp, s, e 
CM: load-context ctemp 
CM:f-move-lL dest, temjp, s, e 

but may be faster. 
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MULTISPREAD-ADD 


MULTISPREAD-S-ADD 

The destination field in every selected processor receives the sum of the signed integer source 
fields from all processors in the same hyperplane through the news grid. 


Formats CM:multispread-s-add-lL dest, source, axis-mask, len 


Operands 


dest 

source 

axis-mask 

len 


The signed integer destination field. 

The signed integer source field. 

An unsigned integer, the mask indicating a set of NEWS axes. 

The length of the dest and source fields. This must be no smaller 
than 2 but no greater than CM:*maximum-integer-length*. 


Overlap The source field must be either disjoint from or identical to the dest field. 

Two integer fields are identical if they have the same address and the same 
length. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g— geometry (current-vp-set) 
let r = rank(g) 

let axis-set = {m \ Q < m < r A (axis-mask(m) = 1)} 

let Ck = { m | m E hyperplane(g, k, axis-set ) A context-flag[m\ = 1 } 

dest[k] *— I source[m ] I 

\m€C* / 

where hyperplane is as defined on page 36. 


See section 5.16 on page 34 for a general description of multispread operations. The 
CM:multispread-s-add operation combines source fields by performing signed integer addi¬ 
tion. 
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MULTISPREAD-ADD 


MULTISPREAD-U-ADD 

The destination field in every selected processor receives the sum of the unsigned integer 
source fields from all processors in the same hyperplane through the news grid. 

Formats CM:muItispread-u-add-lL dest, source, axis-mask, len 

Operands dest The unsigned integer destination field. 

source The unsigned integer source field. 

axis-mask An unsigned integer, the mask indicating a set of news axes. 

len The length of the dest and source fields. This must be non-negative 

and no greater than CM:*maximum-integer-length*. 

Overlap The source field must be either disjoint from or identical to the dest field. 

Two integer fields are identical if they have the same address and the same 
length. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g= geometry [current-vp-set) 
let r = rank(g) 

let axis-set = {m | 0 < m< r A (axis-mask(m) = 1)} 
let Ck = { m | m £ hyperplane(g, k, axis-set) A context-flag[m] = 1} 

dest[k] <— I J2 source[m ] 

\m€C7fc 

where hyperplane is as defined on page 36. 

See section 5.16 on page 34 for a general description of multispread operations. The 
CM:multispread-u-add operation combines source fields by performing unsigned integer ad¬ 
dition. 
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MULTISPREAD-COPY 


MULTISPREAD-COPY 

The destination field in every selected processor receives a copy of the source value from a 
particular value within its scan subclass. 

Formats CM:multispread-copy-lL dest , source, axis-mask, len, multi-coordinate 

Operands dest The unsigned integer destination field. 

source The unsigned integer source field. 

axis-mask An unsigned integer, the mask indicating a set of news axes. 

len The length of the dest and source fields. This must be non-negative 

and no greater than CM:*maximum-integer-length*. 

multi-coordinate An unsigned integer, the multi-coordinate indicating 
which element of each hyperplane is to be replicated throughout 
that hyperplane. 

Overlap The source field must be either disjoint from or identical to the dest field. 

Two integer fields are identical if they have the same address and the same 
length. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g = geometry (current-vp-set) 
let r — rank(g) 

let axis-set = {m \ 0 < m < r A (axis-mask(m) = 1)} 

let c = deposit-multi-coordinate^, k, axis-set, multi-coordinate) 

dest[k\ *— source[c] 

where deposit-multi-coordinate is as defined on page 34. 

See section 5.16 on page 34 for a general description of multispread operations. 
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MULTISPREAD-LOG AND 


MULTISPREAD-LOGAND 

The destination field in every selected processor receives the bitwise logical AND of the 
source fields from all processors in the same hyperplane through the NEWS grid. 


Formats CM:multispread-logand-lL dest, source , axis-mask, len 


Operands 


dest 

source 

axis-mask 

len 


The destination field. 

The source field. 

An unsigned integer, the mask indicating a set of news axes. 

The length of the dest and source fields. This must be non-negative 
and no greater than CM:*maximum-integer-Iength* 0 


Overlap 


The source field must be either disjoint from or identical to the dest field. Two 
bit fields are identical if they have the same address and the same length. 


Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g= geometry {current-vp-set) 
let r = rank{g) 

let axis-set = {m\ 0 < m < r A {axis-mask{m) — 1)} 

let Ck = { m | m £ hyperplane{g , k, axis-set) A context-flag[m] = 1} 

dest[k] <— j /\ source[m] } 

\mec k J 

where hyperplane is as defined on page 36. 


See section 5.16 on page 34 for a general description of multispread operations. The 
CM:multispread-logand operation combines source fields by performing bitwise logical and 
operations. 
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MULTISPREAD-LOGIOR 


MULTISPREAD-LOGIOR 

The destination field in every selected processor receives the bitwise logical inclusive or of 
the source fields from all processors in the same hyperplane through the news grid. 


Formats CM:multispread-logior-lL dest, source , axis-mask , len 


Operands 


dest 

source 

axis-mask 

len 


The destination field. 

The source field. 

An unsigned integer, the mask indicating a set of NEWS axes. 

The length of the dest and source fields. This must be non-negative 
and no greater than CM:*maximum-integer-length*. 


Overlap The source field must be either disjoint from or identical to the dest field. Two 
bit fields are identical if they have the same address and the same length. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g = geometry(current-vp-set) 
let r = rank(g) 

let axis-set = {m \ 0 < m < r A (axis-mask(m) = 1)} 

let Ck = { rn | m E hyperplane(g , k, axis-set) A context-flag[m\ = 1 } 

dest[k\ j V source[m] j 
where hyperplane is as defined on page 36. 


See section 5.16 on page 34 for a general description of multispread operations. The 
CM:multispread-logior operation combines source fields by performing bitwise logical inclu¬ 
sive OR operations. 
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MULTISPREAD-LOGXOR 


MULTISPREAD-LOGXOR 

The destination field in every selected processor receives the bitwise logical exclusive or of 
the source fields from all processors in the same hyperplane through the news grid* 


Formats CM:multispread-logxor-lL dest, source, axis-mask, len 


Operands 


dest 

source 

axis-mask 

len 


The destination field. 

The source field. 

An unsigned integer, the mask indicating a set of news axes. 

The length of the dest and source fields. This must be non-negative 
and no greater than CM:*maximum-integer-length*. 


Overlap The source field must be either disjoint from or identical to the dest field. Two 
bit fields are identical if they have the same address and the same length. 


Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g = geometry(current-vp-set) 
let r = rank(g) 

let axis-set = {ra|0<ra<rA (axis-mask (m) — 1)} 

let Ck = { m | m G hyperplane(g, k, axis-set ) A context-flag[m] — 1} 

dest[k] <— I © source[m] J 
\meCk ) 

where hyperplane is as defined on page 36. 


See section 5.16 on page 34 for a general description of multispread operations. The 
CM:multispread-logxor operation combines source fields by performing bitwise logical ex¬ 
clusive or operations. 
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MULTISPREAD-MAX 


MULTISPREAD-F-MAX 

The destination field in every selected processor receives the largest of the floating-point 
source fields from all processors in the same hyperplane through the news grid. 


Formats CM:multispread-f-max-lL dest , source , axis-mask , s, e 


Operands 


dest 
source 
axis-mask 
s, e 


The floating-point destination field. 

The floating-point source field. 

An unsigned integer, the mask indicating a set of news axes. 

The significand and exponent lengths for the dest and source fields. 
The total length of an operand in this format is s + e + 1. 


Overlap The source field must be either disjoint from or identical to the dest field. 

Two floating-point fields are identical if they have the same address and the 
same format. 


Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g = geometry (current-up-set) 
let r = rank(g) 

let axis-set = {m| 0 < m< r A ( axis-mask(m) = 1) } 

let Ck = { rn \ m E hyperplane(g , fc, axis-set) A context-flag[m] = 1} 

dest\k] <— ( max source\m] 

\mec k 

where hyperplane is as defined on page 36. 


See section 5.16 on page 34 for a general description of multispread operations. The 
CM:multispread-f-max operation combines source fields by performing a floating-point max¬ 
imum operation. 
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MULTISPREAD-MAX 


MULTISPREAD-S-MAX 

The destination field in every selected processor receives the largest of the signed integer 
source fields from all processors in the same hyperplane through the news grid. 


Formats CM:multispread-s-max-lL dest, source , axis-mask, len 


Operands 


dest 

source 

axis-mask 

len 


The signed integer destination field. 

The signed integer source field. 

An unsigned integer, the mask indicating a set of news axes. 

The length of the dest and source fields. This must be no smaller 
than 2 but no greater than CM:*maximum-integer-Iength*. 


Overlap The source field must be either disjoint from or identical to the dest field. 

Two integer fields are identical if they have the same address and the same 
length. 


Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ =1 then 

let g = geometry {current-vp-set) 
let r = rank(g) 

let axis-set = {m|0<m<rA (axis-mask (m) — 1) } 

let Ck = { m | m £ hyperplane(g, k, axis-set) A context-flag[m] = 1} 

dest[k] 4— ^max *owce[m]^ 
where hyperplane is as defined on page 36. 


See section 5.16 on page 34 for a general description of multispread operations. The 
CM:multispread-s-max operation combines source fields by performing a signed integer max- 
imum operation. 
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MULTISPREAD-MAX 


MULTISPREAD-U-MAX 

The destination field in every selected processor receives the largest of the unsigned integer 
source fields from all processors in the same hyperplane through the NEWS grid. 


Formats CM:multispread-u-max-lL dest , source , axis-mask , len 


Operands 


dest 

source 

axis-mask 

len 


The unsigned integer destination field. 

The unsigned integer source field. 

An unsigned integer, the mask indicating a set of news axes. 

The length of the dest and source fields. This must be non-negative 
and no greater than CM:*maximum-integer-length*. 


Overlap The source field must be either disjoint from or identical to the dest field. 

Two integer fields are identical if they have the same address and the same 
length. 


Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ = 1 then 

let g = geometry [current-vp-set) 
let r = rank(g) 

let axis-set — {m \ 0 < m < r A (axis-mask(m) = 1)} 

let Ck = { m | m £ hyperplane(g , fc, axis-set) A context-flag[m] = 1 } 

destlk] *— ( max sourcelm] 

\mec k 

where hyperplane is as defined on page 36. 


See section 5.16 on page 34 for a general description of multispread operations. The 
CM:multispread-u-max operation combines source fields by performing an unsigned integer 
maximum operation. 
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MULTISPREAD-MIN 


MULTISPREAD-F-MIN 

The destination field in every selected processor receives the smallest of the floating-point 
source fields from all processors in the same hyperplane through the news grid. 


Formats CM:multispread-f-min-lL dest, source, axis-mask, s, e 


Operands 


dest 
source 
axis-mask 
s, e 


The floating-point destination field. 

The floating-point source field. 

An unsigned integer, the mask indicating a set of news axes. 

The significand and exponent lengths for the dest and source fields. 
The total length of an operand in this format is s + e + 1. 


Overlap The source field must be either disjoint from or identical to the dest field. 

Two floating-point fields are identical if they have the same address and the 
same format. 


Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g = geometry (current-vp-set) 
let r = rank(g) 

let axis-set = {m|0<ra<rA (axis-mask (m) = 1)} 

let C k = {m \ m e hyperplane(g, k, axis-set) A context-flag[m] — 1} 

destlk] 4— ( min sourcelm] 

\mec k 

where hyperplane is as defined on page 36. 


See section 5.16 on page 34 for a general description of multispread operations. The 
CMimultispread-f-min operation combines source fields by performing a floating-point mini¬ 
mum operation. 
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MULTISPREAD-MIIM 


MULTISPREAD-S-MIN 

The destination field in every selected processor receives the smallest of the signed integer 
source fields from all processors in the same hyperplane through the news grid. 


Formats CM:multispread-s-min-lL dest, source , axis-mask , len 


Operands 


dest 

source 

axis-mask 

len 


The signed integer destination field. 

The signed integer source field. 

An unsigned integer, the mask indicating a set of NEWS axes. 

The length of the dest and source fields. This must be no smaller 
than 2 but no greater than CM:*maximum-integer-length*. 


Overlap The source field must be either disjoint from or identical to the dest field. 

Two integer fields are identical if they have the same address and the same 
length. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g = geometry (current-vp-set) 
let r = rank(g) 

let axis-set = {m \ 0 < m < r A ( axis-mask(m) = 1)} 

let Ck = { m | m E hyperplane(g , fc, axis-set) A context-flag[m\ = 1 } 

destlk] <— ( min source [ml 
Vm€C fc 

where hyperplane is as defined on page 36. 


See section 5.16 on page 34 for a general description of multispread operations. The 
CM:multispread-s-min operation combines source fields by performing a signed integer min¬ 
imum operation. 
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MULTISPREAD-MIN 


MULTISPREAD-U-MIN 

The destination field in every selected processor receives the smallest of the unsigned integer 
source fields from all processors in the same hyperplane through the news grid. 

Formats CM:multispread-u-min-lL dest, source , axis-mask , len 

Operands dest The unsigned integer destination field. 

source The unsigned integer source field. 

axis-mask An unsigned integer, the mask indicating a set of news axes. 

len The length of the dest and source fields. This must be non-negative 

and no greater than CM:*maximum-integer-Iength*. 

Overlap The source field must be either disjoint from or identical to the dest field. 

Two integer fields are identical if they have the same address and the same 
length. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k ] = 1 then 

let g = geometry [current-vp-set) 
let r = rank(g) 

let axis-set = {m|0<m<rA (axis-mask (m) = 1)} 
let Ck = { rn | m £ hyperplane(g , k, axis-set) A context-flag[m] = 1} 

dest\k] <— ( min sourcelm] 

\m<ZC k 

where hyperplane is as defined on page 36. 

See section 5.16 on page 34 for a general description of multispread operations. The 
CM:multispread-u-min operation combines source fields by performing an unsigned integer 
minimum operation. 
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MY-NEWS-COORDINATE 


MY-NEWS-COORDINATE 

Stores the NEWS coordinate of each selected processor along a specified NEWS axis into 
a destination field within that processor. 


Formats CM:my-news-coordinate-lL dest, axis , dlen 


Operands dest 
axis 

dlen 


The unsigned integer destination field. 

An unsigned integer immediate operand to be used as the number 
of a NEWS axis. 

The length of the dest field. This must be non-negative and no 
greater than CM:*maximum-integer-length*. 


Context This operation is conditional. The destination may be altered only in proces 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g = geometry (current-vp-set) 

dest[k] *— extract-news-coordinate(g^axis^k) 

where extract-news-coordinate is as defined on page 33. 


This function calculates, within each selected processor, the news coordinate of that pro¬ 
cessor along a specified NEWS axis. 
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MY-SEND-ADDRESS 

Stores the send-address of each selected processor into a destination field in that processor. 
Formats CM:my-send-address dest 

Operands dest The unsigned integer destination field. This must be no less than 

the value returned by CM: geometry-send-address-length. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 
dest[k] *— k 


This function stores into the dest field, within each selected processor, the send-address of 
that processor. 
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POWER 


The sourcel field (the base) is raised to the power source2 (the exponent). 

The result is stored into the memory field dest The various operand formats allow operands 
to be either memory fields or constants; in some cases the destination field initially contains 
one source operand. 
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S-S-POWER 


Raises a signed integer to a signed integer power. 


Formats CM:s-s-power-3-3L dest, sourcel, source2, dlen, slenl, slen2 

CM:s-s-power-2-lL de$t/sourcel, source2 , Zen 

CM:s-s-power-3-lL dest, sourcel, source2 , Zen 

CM:s-s-power-constant-2-lL dest! sourcel, source2-value, len 
CM:s-s-power-constant-3-lL dest, sourcel, source2-value , Zen 
CM:s-s-power-constant-3-2l dest, sourcel, source2-value , dZen, sZen 

Operands dest The signed integer destination field. 

sourcel The signed integer base field. 
source2 The signed integer exponent field. 

source2-value A signed integer immediate operand to be used as the second 
source. 

Zen The length of the dest, sourcel , and source2 fields. This must 

be no smaller than 2 but no greater than CM: ♦maximum-integer- 
length*. 

dZen For CM: s-s-power-3-3L and CM:s-s-power-constant-3-2L, the length 

of the dest field. This must be no smaller than 2 but no greater 
than CM:*maximum-integer-length*. 

sZen For CM:s-s-power-constant-3“2L, the length of the sourcel field. 

This must be no smaller than 2 but no greater than CM: ♦maximum- 
integer-length*. 

slenl For CM:s-s-power-3-3L, the length of the sourcel field. This must 
be no smaller than 2 but no greater than CM:♦maximum-integer- 
length*. 

slen2 For CM:s-s-power-3-3L, the length of the source2 field. This must 
be no smaller than 2 but no greater than CM:♦maximum-integer- 
length*. 

Overlap The fields sourcel and source2 may overlap in any manner. Each of them, 
however, must be either disjoint from or identical to the dest field. Two integer 
fields are identical if they have the same address and the same length. It is 
permissible for all the fields to be identical. 

Flags overflow-flag is set if the result cannot be represented in the destination field; 

otherwise it is cleared. 
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POWER 


Context This operation is conditional. The destination and flag may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 
if source2[k] < 0 then 
dest[k] 0 

else if source2[k] = 0 then 
dest[k] <— 1 
else 

dest[k] <- (sourcel[k})‘ ource2 W 

if (overflow occurred in processor k) then overflow-flag[k] <— 1 
else overflow-fla^k] 0 


The sourcel field (the base) is raised to the power source2 (the exponent). If the exponent 
is negative, the result is always 0; if the exponent is zero, the result is always 1. 

The result is stored into the memory field dest The various operand formats allow operands 
to be either memory fields or constants; in some cases the destination field initially contains 
one source operand. 

The overflow-flag may be altered by these operations. If overflow occurs, then the destina¬ 
tion field will contain as many of the low-order bits of the true result as will fit. 

The constant operand sourcel-value or source2-value should be a signed integer front-end 
value. The operation is performed properly in all cases; the constant need not be repre¬ 
sentable in the number of bits specified by len. 
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POWER-UP 

This operation resets the Nexus, causing all front-end computers to become logically de¬ 
tached from the Connection Machine system. 


Formats CM: power-up 


Context This operation is unconditional. It does not depend on the context-flag . 


This function resets the state of the Nexus, causing all front-end computers to become 
logically detached from the Connection Machine system. When a Connection Machine 
system is first powered up or is to be completely reset for other reasons, this is the first 
operation to perform. Any of the front-end computers may be used to do it. 

If users on other front-end computers are actively using the Connection Machine system, 
their computations will be disrupted. Normally all the front-end computers are connected 
not only through the Connection Machine Nexus but also through some sort of commu¬ 
nications network; a front end that executes CM:power-up will attempt to send messages 
through this network to the other front-end computers on the same Nexus indicating that 
a CM:power-up operation is being performed. 
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RANK 


F-RANK 

The destination field in every selected processor receives the rank of that processor’s key 
among all keys in the scan set for that processor. 


Formats CM:f-rank-2L dest, source , axis , dZen, s, e, 

direction , smode , sbit 

The unsigned integer destination field. 

The floating-point source field. This is the sort key. 

An unsigned integer immediate operand to be used as the number 
of a NEWS axis. 

The length of the dest field. This must be non-negative and 
no greater than CM:*maximum-integer-length*. This must be no 
larger than the value returned by CM:geometry-coordinate-length. 

The significand and exponent lengths for the source field. The 
total length of an operand in this format is s + e + 1. 

Either :upward or :downward. 

Either :none, : start-bit, or : segment-bit. 

The segment bit or start bit (a one-bit field). If smode is :none 
then this may be CM:*no-field*. 

Overlap The fields source and sbit may overlap in any manner. However, the source 
and sbit fields must not overlap the dest field. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Operands dest 

source 

axis 

dlen 

s, e 

direction 

smode 

sbit 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g = geometry(current-vp-set) 
let Sk = scan-set(g,k , axis , direction , smode , sbit) 
case direction of 
: upward: 

let Lk = { m | m € Sk A (( source[m] < source[k ]) V ( source[m ] = source[k] 

: downward: 

let Lk = { m | m £ Sk A (( source[m] > source[k ]) V ( source[m ] = source[k\ 
dest[k\ \Lk\ 

where scan-set is as defined on page 37. 
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See section 5.16 on page 34 for a general description of scan sets and the effect of the axis , 
direction, smode, and shit operands. 

This operation determines the ordering necessary to sort the source fields within each scan 
set. It does not not actually move the data so as to sort it, but merely indicates where the 
data should be moved so as to sort it. 

In more detail: The dest field in each selected processor receives, as an unsigned integer, 
the rank of that processor’s key within the set of keys in the scan set for that processor. 
The smallest key has rank 0, the next smallest has rank 1, and so on; the largest key has 
rank n — 1 where n is the number of processors in the scan set. This rank may be used to 
calculate a send address a CM: send operation may then be used to put the data into sorted 
order. (An advantage of decoupling the rank determination from the reordering process is 
that the data to he moved may be much larger than the key that determines the ordering, 
and indeed it may be desirable to reorder the other data but not the key itself. In this way 
ranking and reordering each need operate only on the relevant data.) 
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RANK 


S-RANK 

The destination field in every selected processor receives the rank of that processor’s key 
among all keys in the scan set for that processor. 


Formats CM:s-rank-2L dest , source , axis , cZZen, sZera, 

direction , smode , s6it 

The unsigned integer destination field. 

The signed integer source field. This is the sort key. 

An unsigned integer immediate operand to be used as the number 
of a NEWS axis. 

The length of the dest field. This must be non-negative and 
no greater than CM:*maximum-integer-length*. This must be no 
larger than the value returned by CM:geometry-coordinate-length. 

The length of the source field. This must be no smaller than 2 but 
no greater than CM:*maximum-integer-length*. 

Either :upward or :downward. 

Either :none, :start-bit, or : segment-bit. 

The segment bit or start bit (a one-bit field). If smode is :none 
then this may be CM:*no-field*. 

Overlap The fields source and sbit may overlap in any manner. However, the source 
and sbit fields must not overlap the dest field. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Operands dest 

source 

axis 

dlen 

slen 

direction 

smode 

sbit 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

let g = geometry(current-vp-set) 
let Sk = scan-set(g , &, axis, direction , smode, sbit) 
case direction of 
: upward: 

let ifc = { m | m G 5^ A (( source[m\ < source[k ]) V ( source[m] = source[k] 
: downward: 

let Lk = { m | m G Sk A (( source[m ] > source[k ]) V ( source[m ] = source[k\ 
dest[k\ <— \Lk\ 

where scan-set is as defined on page 37. 
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RANK 




See section 5.16 on page 34 for a general description of scan sets and the effect of the axis , 
direction , smode, and sbit operands. 

This operation determines the ordering necessary to sort the source fields within each scan 
set. It does not not actually move the data so as to sort it, but merely indicates where the 
data should be moved so as to sort it. 

In more detail: The dest field in each selected processor receives, as an unsigned integer, 
the rank of that processor’s key within the set of keys in the scan set for that processor. 
The smallest key has rank 0, the next smallest has rank 1, and so on; the largest key has 
rank n — 1 where n is the number of processors in the scan set. This rank may be used to 
calculate a send address a CM:send operation may then be used to put the data into sorted 
order. (An advantage of decoupling the rank determination from the reordering process is 
that the data to be moved may be much larger than the key that determines the ordering, 
and indeed it may be desirable to reorder the other data but not the key itself. In this way 
ranking and reordering each need operate only on the relevant data.) 
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U-RANK 

The destination field in every selected processor receives the rank of that processor’s key 
among all keys in the scan set for that processor. 


Formats CM:u-rank-2L dest , source , axis , d/en, sZen, 

direction , smode , s&it 

The unsigned integer destination field. 

The unsigned integer source field. This is the sort key. 

An unsigned integer immediate operand to be used as the number 
of a NEWS axis. 

The length of the dest field. This must be non-negative and 
no greater than CM:*maximum-integer-length*. This must be no 
larger than the value returned by CM:geometry-coordinate-length. 

The length of the source field. This must be non-negative and no 
greater than CM:*maximum-integer-length*. 

Either :upward or :downward. 

Either :none, :start-bit, or : segment-bit. 

The segment bit or start bit (a one-bit field). If smode is :none 
then this may be CM:*no-field*. 

Overlap The fields source and shit may overlap in any manner. However, the sbit field 
must not overlap the dest field, and the field source must be either disjoint 
from or identical to the dest field. Two integer fields are identical if they have 
the same address and the same length. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Operands dest 

source 

axis 

dlen 

slen 

direction 

smode 

sbit 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ — 1 then 

let g = geometry (current-vp-set) 
let Sk = scan-set(g y k, axis , direction, smode, sbit) 
case direction of 
: upward: 

let Lh = { m | m E Sk A ((source[m] < source[k ]) V ( source[m ] = source[k\ / 
: downward: 

let Lk = { m | m 6 Sk A (( source[m ] > source[k\) V (source[m] = source[k ] / 
dest[k] <— \Lu\ 

where scan-set is as defined on page 37. 
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See section 5.16 on page 34 for a general description of scan sets and the effect of the axis, 
direction, smode, and sbit operands. 

This operation determines the ordering necessary to sort the source fields within each scan 
set. It does not not actually move the data so as to sort it, but merely indicates where the 
data should be moved so as to sort it. 

In more detail: The dest field in each selected processor receives, as an unsigned integer, 
the rank of that processor’s key within the set of keys in the scan set for that processor. 
The smallest key has rank 0, the next smallest has rank 1, and so on; the largest key has 
rank n — 1 where n is the number of processors in the scan set. This rank may be used to 
calculate a send address a CM: send operation may then be used to put the data into sorted 
order. (An advantage of decoupling the rank determination from the reordering process is 
that the data to be moved may be much larger than the key that determines the ordering, 
and indeed it may be desirable to reorder the other data but not the key itself. In this way 
ranking and reordering each need operate only on the relevant data.) 
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S-F-ROUND 


Converts floating-point source field values to signed integer values by rounding to the nearest 
integer. 


Formats 

CM:s-f-rou 

nd-2-2L dest, source, dlen, s, e 

Operands 

dest 

The signed integer destination field. 


source 

The floating-point source field. 


len 

The length of the dest field. This must be no smaller than 2 but 
no greater than CM:*maximum-integer-length*. 


s, e 

The significand and exponent lengths for the source field. The 
total length of an operand in this format is s + e + 1. 


Overlap The fields dest and source must not overlap in any manner. 

Flags overflow-flag is set if the result cannot be represented in the dest field; other¬ 

wise it is cleared. 

Context This operation is conditional. The destination and flag may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 
let v = source[k] 


if v > v + 

| then 

dest[k] 4— [v\ 

else if v < 

v + | then 

dest[k] <— [t>] 

else if even 

(|t>J) then 

dest[k] 4 - 

- M 

else 


dest [A] 4- 

-M 


if (overflow occurred in processor k) then overflow-flag[k] 4 — 1 

The source field, treated as a floating-point number, is rounded to the nearest integer (to 
the nearest even integer if its value is equal to an integer plus |). The result is stored into 
the dest field as a signed integer. 
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U-ROUND 

The quotient of two unsigned integer source values, rounded to the nearest integer, is placed 
in the destination field. Overflow is also computed. 


Formats CM:s-round-3-3L dest, source1, sourceZ , dlen, slenl, slen2 


Operands dest The unsigned integer quotient field. 


sourcel The unsigned integer dividend field. 

source2 The unsigned integer divisor field. 

dlen The length of the dest field. This must be non-negative and no 

greater than CM:*maximum-integer-length*. 

slenl The length of the sourcel field. This must be non-negative and no 
greater than CM:*maximum-integer-Iength* 8 

slen2 The length of the source2 field. This must be non-negative and no 
greater than CM:*maximum-integer-length*. 


Overlap The fields sourcel and source2 may overlap in any manner. Each of them, 
however, must be either disjoint from or identical to the dest field. Two integer 
fields are identical if they have the same address and the same length. It is 
permissible for all the fields to be identical. 


Flags overflow-flag is set if the quotient cannot be represented in the destination 

field; otherwise it is cleared. 

test-flag is set if the divisor is zero; otherwise it is cleared. 

Context This operation is conditional. The destination and flags may be altered only 
in processors whose context-flag is 1. 


Definition 




if context-flag[k] = 1 then 
, , sourcel \k] 
le * * = sourcei\k \ 
t/+ |J then 

W ' ' 


if v > 
dest 


u.cs&[#v;j *— [vj 

else if v < + jJ then 


dest[k] <— [V| 
else if even{\y\) then 
dest[k\ <— [v\ 
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ROUND 


else 

dest[k] <— [v] 

if (overflow occurred in processor k) then over flow-flag\k] <— 1 


The unsigned integer sourcel operand is divided by the unsigned integer source2 operand. 
The mathematical quotient, rounded to the nearest integer (or to whichever of two equally 
near neighbors is even) is stored into the unsigned integer memory field dest. 

The overflow-flag and test-flag may be affected by these operations. If overflow occurs, then 
the destination field will contain as many of the low-order bits of the true result as will fit. 
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U-F-ROUND 

Converts the floating-point source field values to unsigned integer values, which are stored 
in the destination field. 


Formats CM:u-f-round- 2 - 2 L dest, source , dlen, s, e 


Operands dest 


The unsigned integer destination field. 


source The floating-point source field. 


len The length of the dest field. This must be non-negative and no 

greater than CM:*maximum-integer-length*. 

s, e The significand and exponent lengths for the source field. The 

total length of an operand in this format is s + e + 1. 


Overlap The fields dest and source must not overlap in any manner. 

Flags overflow-flag is set if the result cannot be represented in the dest field; other¬ 

wise it is cleared. 


Context This operation is conditional. The destination and flag may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ = 1 then 
if dest > [source\ then 
dest 4 - [source J 
else if dest < [source\ then 
dest 4 - [ source ] 
else if eveni^sourcef) then 
dest \_source\ 
else 

dest 4 - [ source ] 

if (overflow occurred in processor k) then overflow-flag[k] 4- 1 


The source field, treated as a floating-point number, is rounded to the nearest integer (to 
the nearest even integer if its value is equal to an integer plus |), which is stored into the 
dest field as an unsigned integer. 
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RESET-TIMER 


RESET-TIMER 

For the C/Paris and Fortran/Paris interfaces, resets the timing facility before timing other 
operations. 


Formats CM: reset-timer 


Context This operation is unconditional. It does not depend on the context-flag. 


The function CM: reset-timer is used in the C/Paris and Fortran/Paris interfaces to reset the 
facility for timing the execution of other operations on the Connection Machine system. 

One should first call CM:reset-timer to clear the timing counters. Subsequently one may 
alternately call CM:start-timer and CM: stop-timer. The amounts of real time and run time 
between a start and a stop are accumulated into the counters. One may start and stop 
the clocks repeatedly. Every time CM:stop-timer is called, it returns a structure of type 
CM_timeval_t that contains time accumulated between all start/stop call pairs since the last 
call to CM:reset-timer. 

The timing facility is provided in the Lisp/Paris interfaces through the CM:time macro. 
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SCALE 


F-S-SCALE 

In each selected processor, multiplies a floating-point number by a specified power of two 
and stores the result into the destination. 

Formats CM:f-s-scale-2-2L destlsourcel , source2 , slen2> s, e 

CM:f-s-scaIe-3-2L dest^ source i, source2 y slen2 , s, e 

CM:f-s-scaIe-constant-2-lL dest! source!^ source2-value , 5, e 
CM:f-s-scale-constant-3-1 L dest , sourcel , source2-value , 5, e 

Operands dest The floating-point destination field. 

sourcel The floating-point first source field. This is the quantity to be 
scaled. 

source2 The signed integer second source field. This is the base-2 logarithm 
of the scale factor. 

source2-value A signed integer immediate operand to be used as the second 
source. 

The significand and exponent lengths for the dest and sourcel 
fields. The total length of an operand in this format is s + e + 1. 

The length of the source2 field. This must be no smaller than 2 
but no greater than CM:*maximum-integer-length*. 

Overlap The fields sourcel and source2 may overlap in any manner. However, the 
source2 field must not overlap the dest field, and the field sourcel must be 
either disjoint from or identical to the dest field. Two floating-point fields are 
identical if they have the same address and the same format. 

Flags overflow-flag is set if floating-point overflow occurs; otherwise it is unaffected. 

Context This operation is conditional. The destination and flag may be altered only 
in processors whose context-flag is 1. 

Definition For every virtual processor k in the current-vp-set do 
if context-flag[k\ = 1 then 

dest[k\ [^source./ [k] x 2* ottrce5 ^J 

if (overflow occurred in processor k) then overflow-flag[k] <— 1 

The operand sourcel is scaled by the power of two specified by source2 . 

The result is stored into the memory field dest The various operand formats allow operands 
to be either memory fields or constants; in some cases the destination field initially contains 
one source operand. 


s, e 

slen2 
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SEND-ASET32-ADD 


SEND-ASET32-U-ADD 

Sends a message from every selected processor to a specified destination processor and stores 
it there, as if by aset32, in an array. Each selected processor may specify any processor as 
the destination, including itself. A destination processor may receive messages even if it is 
not selected. All incoming messages are combined with the destination array element using 
unsigned integer addition. 

Formats CM:send-aset32-u-add-2L array, send-address , source , index, 

sZen, index-len, index-limit 

Operands array The destination array field. 

send-address The send-address field. For each processor, this indicates to 
which processor a message is sent. 

source The source field. 

index The unsigned integer index into the array field. This is used as a 
per-processor index into array. It specifies portions of the array 
memory area in increments of slen. 

slen The length of the source field. This must be a multiple of 32. 

index-len The length of the index field. This must be non-negative and no 

greater than CM: *maximum-integer-length*. 

index-limit An unsigned integer immediate operand to be used as the 
exclusive upper bound for the index. This is taken as the extent 
of the destination array. 

Overlap The send-address and source may overlap in any manner. The dest field may 
overlap with the send-address or source but, if it does, then it is forbidden to 
send a message to a selected processor. In other words, the dest may overlap 
with the send-address or source only if at most one of them will be used within 
each processor. 

Context This operation is conditional, but whether a message is sent depends only on 
the context-flag of the originating processor; the message, once transmitted 
to the receiving processor, is combined with the the field indicated by array 
regardless of the context-flag of the receiving processor. 


Definition For every virtual processor k in the current-vp-set do 

let Sk = { m | m 6 current-vp-set A context-flag[m] = 1 A send-address[m] = k } 
for every processor k f in Sk do 
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SEND-ASET32-ADD 




if index[k'] < index-limit then 

let r = geometry-total-vp-ratio(geometry(current-vp-set)) 
let m = |^J mod 32 
let i = index[k '] 

for all j such that 0 < j < dlen do 

let tempk(j) = array[k - mx r + (j mod 32) X r](32 x (i + |^J)) 
let sum i, = tempk + source[k'] 
for all j such that 0 < j < dlen do 

array[k - mx r + (j mod 32) x r]{32 X (i + [^J)) <— sumk(j) 

else 

(error) 


For every selected processor p,, a message length bits long is sent from that processor to 
the processor pd whose send address is stored at location send-address in the memory of 
processor p s . The message is taken from the source field within processor p, and is stored 
into an array element within processor pd. Note that in each case the array element to be 
modified in processor pd is determined by the value of index within p s , not the value within 
Pd • 

The CM:send-aset32-u-add operation combines incoming messages with unsigned integer 
addition. To receive the sum of only the messages, the destination array should first be 
cleared in all processors that might receive a message. 
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SEND-ASET32-LOGIOR 


SEND-ASET32-LOGIOR 

Sends a message from every selected processor to a specified destination processor and stores 
it there, as if by aset32, in an array. Each selected processor may specify any processor as 
the destination, including itself. A destination processor may receive messages even if it is 
not selected. All incoming messages are combined with the destination array element using 
bitwise logical inclusive or. 


Formats CM:send-aset32-logior-2L array, send-address , source , index , 

sZen, index-len , index-limit 

Operands array The destination array field. 

send-address The send-address field. For each processor, this indicates to 
which processor a message is sent. 

source The source field. 

index The unsigned integer index into the array field. This is used as a 
per-processor index into array . It specifies portions of the array 
memory area in increments of slen. 

slen The length of the source field. This must be a multiple of 32. 

index-len The length of the index field. This must be non-negative and no 

greater than CM: ♦maximum-integer-length*. 

index-limit An unsigned integer immediate operand to be used as the 
exclusive upper bound for the index . This is taken as the extent 
of the destination array. 

Overlap The send-address and source may overlap in any manner. The dest field may 
overlap with the send-address or source but, if it does, then it is forbidden to 
send a message to a selected processor. In other words, the dest may overlap 
with the send-address or source only if at most one of them will be used within 
each processor. 

Context This operation is conditional, but whether a message is sent depends only on 
the context-flag of the originating processor; the message, once transmitted 
to the receiving processor, is combined with the the field indicated by array 
regardless of the context-flag of the receiving processor. 


Definition For every virtual processor k in the current-vp-set do 

let Sk = { m | m 6 current-vp-set A context-flag[m] = 1 A send-address[m] = k } 
for every processor k ' in Sk do 
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SEND-ASET32-LOGIOR 




if index[k'] < index-limit then 

let r = geometry-total-vp-ratio(geometry(current-vp-set )) 
let m = j^|J mod 32 
let i = index[k'] 

for all j such that 0 < j < dlen do 
let q = k — mxr + (j mod 32) x r 
Iet6 = 32x(i+[^J) 

array[q](b) <— array[q](b) V source[k'](j) 

else 

(error) 


For every selected processor p 3 , a message length bits long is sent from that processor to 
the processor pa whose send address is stored at location send-address in the memory of 
processor p t . The message is taken from the source field within processor p, and is stored 
into an array element within processor pd . Note that in each case the array element to he 
modified in processor pd is determined by the value of index within p t , not the value within 
Pd- 

The CM:send-aset32-logior operation combines incoming messages with a bitwise logical in¬ 
clusive or operation. To receive the logical inclusive OR of only the messages, the destination 
array should first be cleared in all processors that might receive a message. 
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SEND-ASET32-OVERWRITE 


SEND-ASET32-OVER WRITE 

Sends a message from every selected processor to a specified destination processor and stores 
it there, as if by aset32, in an array. Each selected processor may specify any processor as 
the destination, including itself. A destination processor may receive messages even if it is 
not selected. If a processor receives more than one message destinated for the same array 
element, then one is stored in that array element and the rest are discarded. 

Formats CM:send-aset32-overwrite-2L array, send-address , source , index, 

slen, index-len, index-limit 

Operands array The destination array field. 

send-address The send-address field. For each processor, this indicates to 
which processor a message is sent. 

source The source field. 

index The unsigned integer index into the array field. This is used as a 
per-processor index into array. It specifies portions of the array 
memory area in increments of slen. 

slen The length of the source field. This must be a multiple of 32. 

index-len The length of the index field. This must be non-negative and no 

greater than CM:*maximum-integer-length*. 

index-limit An unsigned integer immediate operand to be used as the 
exclusive upper bound for the index. This is taken as the extent 
of the destination array. 

Overlap The send-address and source may overlap in any manner. The dest field may 
overlap with the send-address or source but, if it does, then it is forbidden to 
send a message to a selected processor. In other words, the dest may overlap 
with the send-address or source only if at most one of them will be used within 
each processor. 

Context This operation is conditional, but whether a message is sent depends only on 
the context-flag of the originating processor; the message, once transmitted 
to the receiving processor, is combined with the the field indicated by array 
regardless of the context-flag of the receiving processor. 


Definition For every virtual processor k in the current-vp-set do 

let Sk = { Tn | m 6 current-vp-set A context-flag[m] — 1 A send-address[m ] = k } 

let k ' = choice(Sk) 
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SEND-ASET32-OVERWRITE 

if index[k'] < index-limit then 

let r = geometry-total-vp-ratio(geometry(current-vp-set)) 
let m = mod 32 
let i = index[k'\ 

for all j such that 0 < j < dlen do 

array[k — m x r + (j mod 32) x r](32 x (i + [^J)) <— source[k'](j) 

else 

(error) 


For every selected processor p s , a message length bits long is sent from that processor to 
the processor pd whose send aid dr ess is stored at location send-address in the memory of 
processor p*. The message is taken from the source field within processor p t and is stored 
into an array element within processor p<*. Note that in each case the array element to he 
modified in processor pd is determined by the value of index within p,, not the value within 
Pd • 

The CM: send-aset32-overwrite operation will store one of the messages sent to a particular 
array element, discarding all other messages as well as the original contents of that array 
element in the receiving processor. 
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SEND-TO-NEWS 


SEND-TO-NEWS 


Each processor sends a message to a neighboring processor along a specified NEWS axis. 


Formats CM:send-to-news-lL dest, source , axis , direction , len 

CM:send-to-news-always-lL dest , source , azis, direction , Zen 


Operands cZest 

source 

axis 

direction 

len 


The destination field. 

The source field. 

An unsigned integer immediate operand to be used as the number 
of a NEWS axis. 

Either :upward or :downward. 

The length of the dest and source fields. This must be non-negative 
and no greater than CM:*maximum-integer-length*. 


Overlap 


The source field must be either disjoint from or identical to the dest field. Two 
bit fields are identical if they have the same address and the same length. 


Context This operation is conditional, but whether data is copied depends only on the 
context-flag of the originating processor; the message, once transmitted to the 
receiving processor, is stored into the the field indicated by dest regardless of 
the context-flag of the receiving processor. 

Note that in the conditional case the storing of data depends only on the 
context-flag of the processor sending the data, not on the context-flag of the 
processor receiving the data. 


Definition For every virtual processor k in the current-vp-set do 
if (always or context-flag[k] = 1) then 
let g = geometry [current-vp-set) 
dest[news-neighbor(g, &, axis, direction)] <— source[k] 

The source field in each processor is stored into the dest field of that processor’s neighbor 
along the news axis specified by axis in the direction specified by direction. 

If direction is :upward then each processor stores data into the neighbor whose news coor¬ 
dinate is one greater, with the processor whose coordinate is greatest storing data into the 
processor whose coordinate is zero. 

If direction is : downward then each processor stores data into the neighbor whose news 
coordinate is one less, with the processor whose coordinate is zero storing data into the 
processor whose coordinate is greatest. 
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SEND-WITH-ADD 


SEND-WITH-F-ADD 

Sends a message from every selected processor to a specified destination processor. Each 
selected processor may specify any processor as the destination, including itself. A des¬ 
tination processor may receive messages even if it is not selected, and all the destination 
processors may be in a VP set different from the VP set of the source processors. Messages 
are all delivered to the same address within each receiving processor. All incoming messages 
are combined with the destination field using floating-point addition. 


Formats CM:send-with-f-add-lL dest, send-address, source, s, e, notify 


Operands dest The floating-point destination field. 

send-address The send-address field. For each processor, this indicates to 
which processor a message is sent. 

source The floating-point source field. 

s, e The significand and exponent lengths for the dest and source fields. 

The total length of an operand in this format is s + e + 1. 

notify The notification bit (a one-bit field). This argument may be 
CM :*no-field* if no notification of message receipt is desired. 

Overlap The send-address and source may overlap in any manner. The dest field may 
overlap with the send-address or source but, if it does, then it is forbidden to 
send a message to a selected processor. In other words, the dest may overlap 
with the send-address or source only if at most one of them will be used within 
each processor. 


Context This operation is conditional, but whether a message is sent depends only on 
the context-flag of the originating processor; the message, once transmitted 
to the receiving processor, is combined with the dest field regardless of the 
context-flag of the receiving processor. The notify bit may be altered in any 
processor regardless of the value of the context-flag. 


Definition For every virtual processor k in the current-vp-set do 

let S k = { m | m £ current-vp-set A context-flag[m] = 1 A send-address[m] = k } 
if |5fc| = 0 then 

if notify[k\ ^ CM: *no-field* then notify[k\ <— 0 
else 

if notify[k] ^ CM:*no-field* then notify[k] <— 1 

dest[k] 4 — dest[k\ + I source[m] j 
\meSk ) 
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STORE-FLAG 


STORE-flag 

Conditionally stores a flag bit into memory. 


Formats CM: store-test dest 

CM: store-test-al ways dest 

CM: store-overflow dest 

C M: stor e-overf Iow- a I ways dest 

Operands dest The destination bit (a one-bit field). 


Context The non-always operations are conditional. The destination may be altered 
only in processors whose context-flag is 1. 

The always operations are unconditional. The destination may be altered 
regardless of the value of the context-flag . 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 
dest[k] <— flag[k ] 

where flag is test-flag or overflow-flag , as appropriate. 


Within each processor, the indicated flag for that processor is stored into memory. 
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STRUCTURE-ARRAY-FORMAT 


FE-STRUCTURE-ARRAY-FORMAT 

This instruction returns an array format descriptor for a particular slot in an array of 
structures. A format descriptor may be passed to any array transfer instruction to specify 
a front-end array format, although this is not required. See also CM: fe-array-format and 
CM: fe-packed-array- format. 

This instruction is not provided for the Lisp interface to Paris. 


Formats result «— CM:fe-$tructure-array-format cm-element-byte-size, 

structure-byte-size 

Operands cm-element-byte-size A signed integer immediate operand to be used as the 

number of bytes each Connection Machine element occupies in the 
front-end array. This must be a power of two between 1 and 16. 

structure-byte-size A signed integer immediate operand to be used as the 

length of the front-end structure in bytes. This may be any positive 
integer. 

Result The array format descriptor specified. 


Context 


This is a front-end operation. It does not depend on the value of the context ■ 
flag . 


The return value is a format descriptor for a front-end array of structures. Such a format 
descriptor can be passed to any of the CM array transfer instructions in order to allow 
transfers in either direction between CM fields and a front-end array of structures. If this 
is done, one CM element per selected processor is copied into, or receives data from, the 
specified slot across an array of structures on the front end. 

Values for both cm-element-byte-size and cm-structure-byte-size may be obtained by calls 
to sizeof(...). 

The value of cm-element-byte-size specifies the length of the structure slot in bytes. It also 
defines the unit of measure for the fe-offset-vector argument to the CM: read-from-news-array 
and CM: write-to-news-array instructions. 

The value of structure-byte-size specifies the length of the entire stucture in bytes. It also 
defines the unit of measure for the argument fe-dimension-vector to the CM'.read-from-news- 
array and CM:write-to-news-array instructions. 

If a slot other than the first slot in the front-end structure is the destination of a CM: read- 
from-news-array or the source for a CM: write-to-news-array transfer instruction, then a pointer 
to that slot must be provided as the value of front-end-array. This is a bit tricky. The 
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TO-GRAY-CODE 


U-TO-GRAY-CODE 

Converts an unsigned binary integer to a bit string representing a Gray-coded integer value. 

Formats CM:u-to-gray-code-l-lL dest/source, len 
CM:u-to-gray-code-2-lL dest, source , len 

Operands dest The destination field. 

source The unsigned integer source field. 

len The length of the dest and source fields. This must be non-negative 

and no greater than CM:*maximum-integer-length*. 

Overlap The source field must be either disjoint from or identical to the dest field. 

Two integer fields are identical if they have the same address and the same 
length. 

Context This operation is conditional. The destination may be altered only in proces¬ 
sors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-flag[k] = 1 then 

dest[k](len — 1) <— source[k](len - 1) 
for j from len — 2 to 0 do 

dest[k](j) <— source[k](j) © source[k](j + 1) 

The source operand is an unsigned binary integer, and is converted to a bit-string value in 
a particular reflected binary Gray code. The position of that value in the standard Gray 
code sequence is the source . 

Note that the binary value 0 is always equivalent to a Gray code string that is all 0-bits. 


449 






TRANSPOSE32 


TRANSPOSE32 

Within each cluster of 32 physical processors, .for every group of 32 virtual processors in 
such a cluster, copies one 32-bit field to another. During this copying operation, transposes 
the data as a 32-by-32 bit matrix. Thus, each virtual processor receives one bit from the 
source v&lue of each virtual processor in its group of 32. 


Formats CM:transpose32-l'lL. dest/ source* len 

V e D-XVX^i U ' ; *.0 ' •••*?.’ . Jr. 

CM:transpose32-2-lL dest, source , len 

Operands source The source field. 

dest The destination field. 

len The length of the source and dest fields. This must be non-negative 

and no greater than CM: *maximum-integer-!ength*. This must be 
a multiple of 32. 


Overlap 



The source field must be either disjoint from or identical to the dest field. Two 
bit fields are identical if they have the same address and the same length. The 
fields dest and source may overlap in any manner. 


I Context This operation is unconditional. The destination may be altered regardless of 
the value of the context-flag . 


Definition For every virtual processor k in the current-vp-set do 
‘ if context-flag[k] = 1 then 

for all j such that 0 < j < dlen do 

dest[k\($/uree [32 r + (& mod r) + r(j mod 32)j ^32 [^J + 

where r is the value of CM: *virtual-to-physical-processor-ratio* and j is the bit 
position in each field. 

This instruction copies each 32-bit field to the corresponding 32-bit field within each virtual 
processor. In the course of copying the bits, it “transposes” them so that a 32-bit value 
lying entirely within the source field of one virtual processor is made to occupy a memory 
slice, that is, one bit in each of 32 virtual processors. The opposite is also true: the 32-bit 
value that ends up in the dest field of a virtual processor is made up of one bit from each 
of 32 virtual processors. Transposed data is said to be stored in a slicewise format. 

For the purposes of this instruction, the physical processors are divided into clusters of 32. 
Two processors are in the same cluster if their physical processor numbers agree in all but 
the five least significant bits. 
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TRANSPOSE32 


The virtual processors are similarly divided into groups of 32; a group of virtual processors 
consists of one virtual processor from each physical processor of a cluster, such that the vir¬ 
tual processors occupy the same physical memory locations within their respective physical 
processors. Thus, two virtual processors are in the same group if their virtual processor 
numbers agree in all but bit positions n through n + 4, where n is the number of virtual 
processors bits in each physical processor. G " 

' i. a: ' -r, KB J 


The CM:transpose32 operation may then be understood as taking the 32 32-bit source values 
from a group of 32 virtual processors as the rows of a 32-by-32 bit matrix, and then.storing - 
the columns of this matrix into the dest fields of these same virtual processors. 


; . ■ . . *h • ■ 

The process may be understood pictorially. Suppose that before the operation the memory 
of a group of 32 virtual processors looks like this: 


processor SOUTCe dest 



: ■ Ji ri 

•k i; \ ^ ai'(l 


Then, after the CM: transpose-data operation, it will look like this: 


. K 1. Kii.l ie'‘r 
■ . Oi.x ,:^T 
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TRANSPOSE32 


processor SOUTCe dest 



t is required to 

.WftTTOl S.TIil;f 

This instruction reorients processor data into a slicewise format that permits rapid, indirect 
field-addressing; AMemory Region cdntainihg transposed^"data^ihay TSe'Viewed either as a 
single, shared slicewise array or as a set of parallel sUkkwise arfays. (See the CM:aref32 and 
CM:ate£32“shared dictionary entries for ^ -desGription of thes^ data formatSo) Viewed as a 
shared slicewise array, this is especially useful for quickly constructing lookup tableSo 

*•] >.y>. •. iv ? n> £ ‘;.o> ^ ixr ':vr r 1 

Transposition is reversed by applying the CM:transppse32 instruction to a field already 

stored in the slicewise format. To preserve the correlation between processors and data, 
this instruction should not be used on slicewise data that was orginally stored by providing 
CM:aset32 or CM:aset32-shared with an index-limit other than 32. 

TU J fid') Oil f 7 • *'l 1,1 XW**Ti*X"< 48 eX XOuXXii 

*15x107 »;x >/- l ... ■ • '" • • * < * r.i v. a.- n x dw o * - .... 


Kiitft$Ledge o^th^^ih^iAM details of Coimettidd { ^il,cliinb"^l : 


uSe ^lilsdil scftitfce valued r^pfosehtb^dii 
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TRUNCATE 


F-F-TRUNCATE 


Rounds each source field value to the largest integral value not greater than that value ajnd 
stores the result as a floating-point number in the destination field. 


Formats CM:f-f-truncate-l-lL destlsource, s, e m 

CM:f-f-truncate- 2 -lL dest, source, s, e 

‘ I 

Operands dest The floating-poiht destination field. 

source ^he floating-poiiit source field. s v wmmmmmm mm 

. .s,JS _The_sigiufic&dUind-ejcbonentleng-thsibr-the<ie«t-and-sowee 4 ieldsi 

* • G fJif- tic' * ■' ft 

The total length of an operand in this format is s + e + 1 . 

Overlap The must ,b,e,^either ;disjoint from pr idg^ticat tOfjthe, 

^wo flpathig: r potn^ jfiplds are f iden.tical if. they have Ahfj. 
same format. 

i ; i .■- ' /■ 10?: •:■ _? ’-w?! noh nn jj ■ ■ ■ . T 

Context This operation is conditional. The destination may he altered only In pjrocessu 

sors whose context-flag is 1 . a ^ K ... -y-bw-i ;ui ' uuiv -sl^nie 

_1_^__________.,u„, ;;,ui,i,r,nu 


A 1 


Definition For every virtual processor k in the current-vp-set do 
if context- flag[k] = 1 then 

dest[k] <— sign(source) x [|source[fc]|j 


.ri r -isii: 
iu joqaufi'iT 

'> UKO'i 


The source field, treated as a floating-point number, is rounded to the nearest integer in 
the direction of zero, which is stored into the dest field as a floating-point number. 
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TRUNCATE 


if source2[k\ = 0 then 
dest[k\ <— (unpredictable) 
else 


desf[fc] 


sourcel 


source2 


if (overflow occurred in processor k) then overflow-flag[k] 
else overflow-fla^k] <— 0 


The unsigned integer sourcel operand is divided by the unsigned integer source2 operand. 
The floor of the mathematical quotient is stored into the unsigned integer memory field 
dest. The various operand formats allow operands to be either memory fields are constants; 
in some cases the destination field initially contains one source operand. 

The overflow-flag may be affected by these operations. If overflow occurs, then the desti¬ 
nation field will contain as many of the low-order bits of the true result as will fit. 

The constant operand sowrce2-value should be a signed integer front-end value. The op¬ 
eration is performed properly in all cases; the constant need not be representable in the 
number of bits specified by len. 
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TRUNCATE 


UP-TRUNCATE 

Rounds each source field value to the largest integer not greater than that value and stores 
the result as an unsigned integer in the destination field. 


Formats CM: u-f-truncate-2-2L dest, source, dlen, s, e 


Operands dest 

source 

len 

s, e 


The unsigned integer destination field. 

The floating-point source field. 

The length of the dest field. This must be non-negative and no 
greater than CM:♦maximum-integer-length*. 

The significand and exponent lengths for the source field. The 
total length of an operand in this format is s + e + 1. 


Overlap The fields dest and source must not overlap in any manner. 


Flags overflow-flag is set if the result cannot be represented in the dest field; other¬ 

wise it is cleared. 


Context This operation is conditional. The destination and flag may be altered only 
in processors whose context-flag is 1. 


Definition For every virtual processor k in the current-vp-set do 
if context-fla^k] — 1 then 

dest *— sign(source) x [] source |J 

if (overflow occurred in processor k) then overflow-flag[k ]■«— 1 


The source field, treated as a floating-point number, is rounded to the nearest integer in 
the direction of zero, and the result is stored into the dest field as an unsigned integer. 
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